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. In the first part of this paper, we provide polynomial quantum algorithms for additive approxima- 
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tions of the Tutte polynomial, at any point in the Tutte plane, for any planar graph. This includes an 



■ additive approximation of the partition function of the Potts model for any weighted planer graph at 
■ 

' any temperature, as well as approximations to many other combinatorial graph properties described 

' by the (multivariate or not) Tutte polynomial. 

To achieve these algorithms, we generalize the Temperley Lieb algebra representations, used in 

-I— > ' 

^ ' [6], to apply for any graph (not necessarily coming from a braid). Moreover, our representations 

a ■ 

r- { ' are non-unitary, as are all representations of the Temperley Lieb algebra not corresponding to Jones 

^ '' polynomial related parameters. It might seem at first sight that this makes it impossible to apply 

them by a quantum circuit. We show how to do this nevertheless. The approximation window size 
turns out to be inverse polynomial in \G\ times the product of the norms of the operators we apply. 
I Additive approximations are tricky; the range of the possible outcomes, might be smaller than the 

size of the approximation window, in which case the outcome is meaningless. Unfortunately, ruling 
out this possibility is difficult; If we want to argue that our algorithms are meaningful, we have to 
provide an estimate of the scale of the problem, which is difficult here exactly because no efficient 
algorithm for the problem exists! 

In the second part of the paper we provide an indirect but very convincing proof that our approx- 
imation is meaningful for a large range of parameters, by showing that in those cases, the problems 
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our algorithms solve are complete for quantum polynomial time. We thus get a large class of new 
BQP-complete problems. This result is particularly interesting in the case where the relevant repre- 
sentations are non-unitary, since in this case even the notion of universality seems counter intuitive. 
Indeed, the universality proof is extremely involved technically, and requires many new innovative 
ideas. 

The case of the Potts model parameters deserves special attention. Unfortunately, despite being 
able to handle non-unitary representations, our methods of proving universality seem to be non- 
applicable for the physical Potts model parameters. We can provide only weak evidence that our 
algorithms are non-trivial in this case, by analyzing their performance for instances for which classical 
efficient algorithms exist. The characterization of the quality of the algorithm for the Potts parameters 
is thus left as an important open problem. 

To summarize, the main progress in this work is in our ability to handle non-unitary repre- 
sentations; by doing so, we provide many new quantum complete problems corresponding to the 
approximation of the Tutte polynomial at various points. We believe this work, and in particular, the 
progress we make in handling both algorithmically, as well as from the universality point of view, the 
non-unitary representations, is an important step towards understanding the quantum complexity of 
the Potts model problem. This is the first non-trivial progress on this problem since it was posed, 
more than a decade ago, as a challenge to quantum algorithms. Many open problems are raised 
by this work, other than the clarification of the complexity of the Potts parameters. A particularly 
interesting one is finding other scenarios where (non-unitary) representations of algebras can be used 
to derive efficient quantum algorithms for combinatorial problems. 

1 Introduction 

In the search of the past deeade for fast quantum algorithms for problems for which no efficient classical 
algorithm is known, several problems have been marked as good candidates. The most commonly known 
one is the graph isomorphism problem, but much effort was devoted also to lattice problems, such as the 
shortest and closest vector in a lattice problem, and to other problems. Unfortunately, no progress was 
made in any of these questions on the algorithmic front (but see interesting complexity theoretic results 

mm) 

One of the problems that attracted the scientific community effort was that of the approximation of 
the Potts model partition function. The Potts model (also known as the g-state Potts model) is a famous 
model originating from statistical physics |25j , which is a generalization of the Ising model [27j to more 
than two components. It was proposed by C. Domb and his then research student R. B. Potts almost 
five decades ago and has since become a rich and active area of statistical physics. 



2 



One considers an edge- weighted graph (in this paper we consider only planar graphs), the nodes of 
which can be colored in one out of q colors. Each color configuration of the nodes, cr, is given an energy 
H{a) which depends on how many edges in the graph are monochromatic when colored by cr, and on 
the weights of the edges. The probability for the system to be in a given configuration is determined by 
the Boltzmann-Gibbs distribution, e~^(°')/('''i'i") where T is the temperature and fc^ is the Boltzmann 
constant. This model captures the essentials of many physical systems related to solid states, and 
physicists and mathematicians arc therefore very interested in the properties of the model . 

It turns out that almost all important properties of the Potts model, as well as other statistical 
physical systems, can be derived from a certain quantity called the partition Junction of the system. This 
is given by a sum running over all possible colorings of the graph, of the Boltzmann-Gibbs distribution 
(which is unnormalized) . The problem of evaluating, or even approximating, the Potts model partition 
function, has become a very important problem in mathematical physics. 

The Potts model partition function is well known to be a special case of the Multivariate Tutte 
Polynomial of a graph [301. This is a polynomial that can be defined for every edge- weighted graph 
G = {V,E) and an additional variable q that is related to the number of colors in q of the Potts model. 
It is conventionally denoted by Zc{q,'v) with v = {ue|e e E} being the set of weights of all edges. The 
multivariate Tuttc polynomial is a generalization to more than two variables of the well-known Tutte 
polynomial [31j . The latter captures an extremely wide range of interesting combinatorial properties 
of graphs. It also generalizes the Jones polynomial of alternating links, the reliability of a network, 
the number of spanning trees, and more. As in this paper we will mainly work with the multivariate 
Tutte polynomial, we will occasionally refer to it simply as the "Tutte polynomial" . The original Tutte 
polynomial will be referred as the "standard" Tutte polynomial. 

The exact evaluation of the standard Tutte polynomial for planar graphs turns out to be hard at 
all but several trivial points [35j . But what about approximations? Researchers have devoted much effort 
to the attempts of providing approximating algorithms for the Tutte polynomial at various points, and in 
particular, for the Potts model partition function. One is ultimately interested in providing an FPRAS 
(fully polynomial randomized approximation scheme), which gives as good an approximation as one 
desires with polynomial overhead. The most common approach for this matter is that of the celebrated 
Markov chain Monte Carlo method [32l[33j. For high temperatures, it is known that this approach works 
for the ferromagnetic case (where the edge weights are positive), and so an FPRAS exists . On the other 
hand, it is known that an FPRAS is NP hard to achieve for the anti- ferromagnetic case and in 
fact, this hardness result holds for about three quarters of the Tutte plane, so to speak. The question 
of achieving an approximation for the ferromagnetic case at various points, as well as for other points in 
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the Tutte plane for which an FPRAS might still be possible, is an extremely important question. We are 
not aware of any complexity theoretic restrictions that would imply such an approximation unlikely; see 
Ref. [31. 

1.1 Results 

Our first result is an additive, rather than multiplicative, quantum approximation algorithms for the 
standard Tutte plane at any point, including all points corresponding to the Potts model at any number 
of colors and any temperature, and any set of weights on the edges. Roughly speaking, an FPRAS 
algorithm approximates the quantity X by a number within the range [X — X/poly{n), X + X/poly{n)], 
with n being the order parameter of the problem. On the other hand, our additive approximation 
scheme provides an approximation within the range [X — A/poly{n), X + A/poly{n)], with A being some 
parameter that can be easily calculated from the input. We call A the approximation scale of the problem. 
Obviously, if A = 0{X) then our scheme is equivalent to FPRAS - but this is usually unknown. 

Theorem 1.1 (Quantum Algorithm, rough version) There exist an efficient quantum mechanical 
algorithm for the following problem. The input is a planar graph, with ( complex) weights on the edges, and 
a (complex) number q. The output is an additive approximation of the (multivariate) Tutte polynomial 
of the graph with those weights. 

The approximation scale of the algorithm is not specified in this rough version of the theorem; it 
requires some prior definitions before we can state it, and so it will be given later on. 

The size of the approximation scale is crucial when one considers additive approximations. This is 
because one has to be convinced that the size of the window is not exponentially larger than the scale of 
the problem - otherwise the problem the algorithm solves is trivial. Unfortunately, it is quite difficult to 
give bounds on the scale of the problem given the hardness to approximate it. 

To show that in many cases our algorithms do not fall into the trivial category, we would like to show 
that the problems the algorithms solve arc hard. Wc start by proving: 

Theorem 1.2 (BQP hardness, rough version) There exists a wide range of complex weights and 
complex values of q, for which the additive approximation of the multivariate Tutte polynomial to within 
a certain scale, is ^QP -hard. 

As in the previous theorem, the exact size of the approximation window and the precise definition of the 
complex weights and q for which the theorem holds, will be given later on. Examples for such parameters 
are 
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• g = 3, and the weights js {e'^'/'^ - l) \ 3 (l - e-^'/^y^ , e"/^ - 1, 1 - e-'^'/^j ^ 

• g = 2i, and the weights {100, -2i- 100, 1,-1/2} , 

• g = 3, and the weights {100, -103, 1,-1/2} . 

We shaU see later that these three examples actually fall into three different classes of weights that we 
call unitary weights, non-unitary complex weights and non-unitary real weights. Their names imply the 
type of operators that they define. 

The proof of universality of the parameters which correspond to unitary operators, includes as a 
special case the universality proof of the approximation of the Jones polynomial [SI [51 [J . Indeed, the 
proof of the unitary case follows closely the proof of [7|. The result, however, applies to many more 
parameters than the Jones related corresponding result |7j. Essentially, the generalization comes from 
the following fact. An edge weight in our construction corresponds to two different matrices, depending 
on the orientation of the edge. In the Jones polynomial case, those two matrices need to be unitary. When 
we consider the multivariate Tutte polynomial, we can use two different edge weights, and therefore we 
can relax this restriction and require that only one of the matrices corresponding to each of these weights 
is unitary. This gives many more parameters for which the result holds. This will be explained later on. 

Proving BQP hardness for the complex non-unitary case requires much more extra work, compared 
to the unitary case. In fact, even the very notion of universality when the generators are non unitary 
seems counter intuitive, and it is not clear how to even start. The proof follows very roughly the same 
outline of [7], but we need to develop many new tools that enable us to deal with the non-unitarity of 
the operators; we will elaborate on that later on. The proof for the case of real non-unitary matrices 
requires yet another separate treatment since now universality is interpreted as density and efficiency in 
the orthogonal group rather than the unitary group, which poses more obstacles. 

Comparing Theorem 1 1 . 1 1 and Theorem 1 1.2[ we see that we get complementary results: an algorithm 
(for unrestricted parameters) and a hardness result (for a restricted set of parameters). We would like 
to deduce that the problems for which both results hold, are BQP complete. However, for this to hold, 
we need the approximation windows in both results to match. It turns out that for the parameters of 
the unitary case, this happens without any extra effort. Therefore the unitary set of weights provides us 
with a wealth of new BQP-complete problems. 

For the non-unitary cases, this is not automatically true; due to reasons that we will see later on, 
the approximation scale of the algorithm is larger than what is required for universality, and thus, the 
algorithm solves an easier problem. To match the two problems, we modify the definition of the problem 
in a somewhat artificial way. but that does not seem to damage the main point of the result. In the newly 
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defined problem, the input is a pair of a planar graph, together with a partition of its edges into groups. 
The approximation scale is then defined using this partition, and it is for this problem that we are able 
to show completeness. 

Theorem 1.3 (Completeness, rough version) The problem of approximating the Tutte polynomial 
of a given planar graph with a given partition of its edges, to within an additive approximation scale which 
is defined by the input graph and partition, is BQP complete. 

Consequently, we get a wide range of new BQP-complete problems, corresponding to unitary and 
non-unitary parameters of the Tutte polynomial. 

We remark that unfortunately, our universality proof docs not hold for the Potts model parameters, 
(regardless of the size of the approximation window.) This raises the question, of whether our algorithms 
perform any non-trivial task in this range of parameters. We provide very weak evidence for non-triviality, 
by analyzing the performance of the algorithm in the case of a family of graphs for which we can easily 
calculate the Potts model: the line, with various edge weights. We see that the algorithm's approximation 
window is such that it distinguishes, with high probability, between the Potts partition functions for the 
graphs with different weights. Of course, for those graphs the exact Potts model can be calculated exactly 
efhciently, and so this cannot be regarded as a proof of non-triviality. The very interesting open question 
of characterizing the quantum complexity of the physical Potts problem is left open for future research. 

We proceed to outline the main new ideas introduced in this work. 

1.2 Main Ideas in the Algorithm 

The work is divided into two unequal parts: the algorithm and the universality proof. We start with the 
main ideas underlying the algorithm. 

In the work of Aharonov, Jones and Landau regarding the Jones polynomial [S], a special case of 
our current problem was solved. One can view the solution there as follows: One considers a braid, 
and represents it in an algebra spanned by pictures similar to braids but with no crossings, like those in 
SectionUl This algebra is called the Temperley Lieb algebra (see Ref [13] and §12.4 in [H]), and is denoted 
by TLn{d), where n is the number of strands in the braid. It turns out that the Jones polynomial is in 
fact equal to a certain function called the Markov trace of the Temperley Lieb element that corresponds 
to the given braid, and this trace function satisfies a certain property called the Markov property. To 
compute this trace, the idea is to use the following fact: If a matrix representation of the TLn{d) algebra 
can be assigned a weighted trace which satisfies the Markov property, then it will be equal to the Markov 
trace of the corresponding Temperley Lieb elements. It thus suffices to compute this weighted trace of 
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the matrices by a quantum computer. Fortunately, there are known unitary representations of braids, 
induced by the so called path representation of the Temper ley Lieb algebra. Thus, the computation can 
be performed quite easily by a quantum computer: for each crossing in the braid, the algorithm applies 
the corresponding unitary matrix. The weighted trace of the overall unitary matrix is easy to estimate 
using standard techniques in quantum computation. 

We will build on the above method. We start by generalizing the Temperley Lieb algebra that is used 
in [6] to an infinite algebra, which we denote by GTL(d), where the number of strands is not fixed; in the 
physics language, we allow creation and annihilation operators. This allows us to handle any graph, and 
not just graphs originating from braids. What more, it allows us to relax the requirement of the Markov 
property, and use any representation! This is because miraculously, once we deal with representations of 
the more general algebra, it is no longer the Markov trace of the Temperley Lieb element that corresponds 
to the Tutte polynomial, but simply the overall factor multiplying the identity element of the GTL{d) 
algebra. 

We are thus looking for ways to approximate this norm efficiently using a quantum computer. The 
next step is to apply the matrices from the representation by a quantum computer. The most important 
apparent obstacle here is that it seems that one is restricted to use unitary representations of the Tem- 
perley Lieb algebra if one is to apply it by a quantum computer. It turns out, however, that this is not at 
all a necessary restriction. We find a way to apply the representation we use even when it is non-unitary. 
Our only restriction is that the norm of the operators we apply needs to be 1; if it is not, we divide the 
whole operator by its norm, and book keep the extra factor. We lose in the approximation scale a factor 
which is exactly this norm. 

This gives an efficient quantum algorithm that approximates the Tutte polynomial to within a given 
additive approximation, where the scale of the approximation is an inverse polynomial times the product 
of the norms of the operators it applies. 

We can perhaps understand better now the notion of additive approximation in this context. The 
norm of the overall product of operators is what we are looking for: it is equal to the value of the Tutte 
polynomial. Clearly, this norm, the norm of the product of operators, is smaller than the product of 
the norms. It is this latter bigger quantity, the product of norms, that we view as the scale of the 
approximation. Our algorithm approximates the overall norm, to within inverse polynomial times the 
scale. 
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1.3 Main ideas in the proof of universality 

We next show that finding an additive approximation of the Tutte polynomial, to within a certain 
approximation scale is as hard as quantum computation, for many weights and q^s. Note that we first 
show hardness of approximation to within a smaller scale than our algorithms provide; that is, we show 
universality of a slightly more difficult problem, with a smaller scale. This leaves a gap between the 
performance of the algorithm and the hardness result for most cases; We resolve this issue later on. 

As a first step, we would like to show universality for a set of parameters which correspond to the 
case in which the operators being applied are unitary (we include in this case also operators which are a 
scalar times a unitary). The universality proof itself follows quite closely the proof of universality of [7j, 
except for issues related to the book keeping of overall factors. 

The proof is significantly harder when we move to the non-unitary case, where we have to overcome 
a true difficulty: The matrices we are supposed to use in order to express any quantum circuit, are non 
unitary! In universality proofs, one usually shows density in the unitary group, and then uses the Solovay- 
Kitaev theorem that shows that density implies efiiciency. In our case, both seemed to be impossible; 
working with general linear operators, which apply stretching and shrinking, how can one show density in 
the unitary group? And moreover, even if density applies, how can one use the Solovay Kitaev theorem? 
To explain how we overcome these difficulties, let us first recall the main ideas in the proof of [7]. 

The main idea is to encode the n qubit Hilbert space into the space of paths on finite graphs the shape 
of a line. Then using the path representation, elements of the TLn{d) algebras are mapped to operators 
on that space. The paths are encoded by strings in which means a step to the left and 1 means a step 
to the right. This is not a tensor product space, but one can easily encode a tensor product space into 
it; A path of length 4 of the form 0101 corresponds to the state |0), and a path of length 4 of the form 
1100 corresponds to the state |1). Thus, paths of An steps include the encoding of n qubits. Two qubits 
gates are encoded as transformations that work on 8 steps paths. We note that the 8 steps paths include 
many paths that are not one of the 4 legitimate 8 steps paths that correspond to encoded two qubits. 
What more, as wc will sec, those four legitimate paths are not an invariant subspace, and so density will 
have to be proven in the smallest invariant subspace that contains them - which is the subspace of all 8 
steps paths that start and end at the first vertex of the graph. There are 14 such paths. To approximate 
two qubit gates, we will thus have to approximate a matrix in SU{IA). 

In [7], the first building block is to show density in SU{2), on two of those 14 paths. This is called 
the seed; It is pretty easy to establish. Then one can use various tools to build up the dimensionality of 
the dense subgroup, to get to the 14 relevant dimensions, in which all two qubit gates are encoded; one 
main tool is the Bridge lemma [7] which states that given matrices which generate dense subgroups on 
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two orthogonal subspaces, and a matrix mixing those subspaces up (the "bridge"), the resulting group 
is dense on the direct sum of the two spaces. Another tool is the Decoupling lemma: if we have density 
on two unitary groups on two orthogonal subspaces of different dimensions, then even though these two 
generated groups might in principle be coupled or correlated, they are in fact decoupled and we can 
continue as if we have universality on each one of them separately. 

Let us now tentatively explain how one modifies this proof to handle non-unitary parameters. We 
start with the first building block, namely, constructing density on a two dimensional subspace. Unlike in 
the unitary case, here we are led to prove density in the special linear group SL{2, R) or SL{2, (D) instead 
of SU{2). In the quantum literature, as far as we know, density of non- unitary operators was not dealt 
with before; in fact, the theory here is very different than that in the unitary case. Fortunately one can 
use a wealth of results from theory of complex Mobius transformations for that matter. In particular, we 
use Jorgensen's inequality [35] to find sets of parameters for which density holds. We then reprove the 
Bridge lemma and the Decoupling lemma for the non-unitary case; once again there are some technical 
issues which one has to deal with when considering non-unitary matrices. Finally, we need to reprove the 
Solovay Kitaev theorem. In this theorem, much technical effort is given to the issue of accumulated errors. 
Now we have new errors to deal with since our matrices are only close to, but not inside, the unitary 
group. For that matter, we generalize the theorem for the non-unitary groups SLn{<C) and 5i,i(R). 

We get that any two-qubit gate can be replaced by polylogarithmically many GTL{d) elements, which 
are mapped under the path representation to an operator that acts on 8-steps paths. 

The final result is a mapping of the quantum circuit to a planar graph, such that the result of the 
circuit can be read from an appropriate approximation of the Tutte polynomial of the graph at the 
relevant parameters. 

1.4 Proof of BQP Completeness 

In the hardness result, the approximation scale turns out to be the same as the algorithmic scale only in 
the unitary parameters case. In this case, completeness is already proved. 

To match the approximation scales of the algorithm and the universality results also in the non-unitary 
cases, we note that our algorithm in the non-unitary case, was quite wasteful in its approximation quality. 
Recall that the size of the approximation scale is the product of the norms of the matrices being applied. 
However, note that the norm of the product of operators might be significantly smaller than the product 
of the norms, i.e., the scale of the problem. It is thus beneficial to apply the operators in groups, and not 
one by one. This can be done easily if the operators we would like to apply in one group, all operate on 
a small dimensional space in a sequence. In this case, before applying them, we calculate their product 
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by a classical computer on the side, and apply the resulting product by the quantum computer. 

In the context of the graph described in before, corresponding to the quantum circuit, there is a very 
natural grouping: Each gate in the circuit was replaced by polylogarithmically many operators, which 
we would like to view as one group. If we could group all operators corresponding to one quantum gate 
together, the resulting operator, which is approximately a unitary operator over 8-steps paths, has norm 
approximately 1, and thus does not increase the approximation scale at all. We would have liked to do 
this, rather than to apply the approximating sequence of gates one by one, which may contribute a lot 
to the approximation window. 

Unfortunately, to be able to achieve this improved approximation scale, the quantum algorithm needs 
to first recognize all operators coming from the same gate as belonging to one group. More generally, 
the algorithm would need to find a good way to partition the operators. So far we have not been able 
to find an efficient grouping algorithm that would achieve this goal. We are thus led to a less elegant 
solution: we modify the problem slightly. Our input is now a pair of a graph together with a partition 
of its edges to groups. The output is the Tutte polynomial of the graph, to within the approximation 
window defined by the given grouping. When this is done this way, the two approximation scales match, 
and we get both BQP hardness as well as an algorithm. 

The proof that this works is in fact quite involved, due to a subtle point in the above argument, which 
causes a lot of trouble in the proof. Note that the idea above relied heavily on the fact that the norm of 
the product of all operators in one group is 1 (or very close to 1). However, the operators we consider 
operate on the direct sum of many subspaces (denoted A^^i), whereas only the first subspace, Ai_i, is 
relevant for the quantum computation. It could be that the approximation is unitary in the relevant 
subspace, whereas in the other subspaces the resulting product of operators has a much larger norm; the 
quantum algorithm only guarantees approximation to within this (possibly too large) norm. To really 
match the sizes of the approximation windows, we have to somehow address this issue. 

Our solution is as follows. We make sure that the norm of each group of operators is indeed approxi- 
mately 1, even when the entire space is considered, by proving density and efficiency in the unitary group 
simultaneously on all those subspaces. This requires some work since we need to deal with the additional 
subspaces one by one. Moreover, we need to generalize the Solovay Kitaev to hold also for direct sums of 
unitary groups, and we can only show this for a finite sum. Unfortunately, all the above manipulations 
imply that we have to limit the parameters with which we work, and so the set of points for which we 
get completeness seems smaller than that for which hardness holds. 

The above seems like a very complicated solution to a rather minor and technical issue; we speculate 
that a better solution to this technical point exists, which would enable to handle it in a more elegant 
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way. 

The final result is that for a large class of parameters, we can show that the problems with the 
same approximation scale is both doable in quantum polynomial time and also universal - namely, it is 
complete. 

1.5 Remarks on the Complexity of the Potts model case 

Unfortunately, our methods are not applicable for showing that values of the graph weights and q that 
correspond to physical Potts model parameters, are BQP hard, (for any additive approximation). 

The reason is essentially this. The partition function of physical parameters is, by definition, a 
function that corresponds to some Gibbs distribution, and is thus, always positive. Our methods, however, 
translate a unitary matrix to a graph, such that, say, the first entry of the unitary matrix is equal to 
the partition function. Proving universality would mean that any unitary matrix can be presented this 
way, including unitary matrices whose first entry is negative. But there is no graph whose Potts model 
partition function would be negative, as it is a positive number! 

The exact argument is slightly more complicated, and will be given in a later version of the paper. 
The conclusion is that at least with our methods, BQP completeness seems impossible to prove. Perhaps, 
BQP completeness does not hold. 

As we mentioned, we provide weak evidence for non-triviality in a different way. We consider two 
families of weighted graphs for which the evaluation of the Potts model can be done efficiently classically. 
The two families have the same underlying graph: the line of n sites. However, we consider this graph 
with different weights on the edges. It is quite easy to show that the quantum algorithm distinguishes 
between the Partition functions in the different cases, namely, the window in which one answer might 
lie does not intersect the window in which the other answer lies. It is unclear how well the algorithm 
performs for more difficult graphs, in which no efficient classical algorithm is known. 

Proving anything about the complexity of our algorithm for the Potts model, remains a very important 
open problem. It is still possible that this case of the Tutte polynomial, with our additive approximation 
window, can be solved by an efficient classical algorithm. 

1.6 Conclusions and Open Questions 

In this work we provide new quantum algorithms for many new problems, all castcd in the framework of 
approximations of the Tutte polynomial. Our quantum algorithms are distinctly different from the well 
trodden path of quantum algorithms for the Hidden Subgroup problem. They take the approach of |6j in 
which local structures of the problem are encoded into linear operators which are then applied; however. 
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we take a significant step forward, by leaving the restriction of unitarity behind, which aUows us to apply 
our algorithms and the hardness results to many new points. 

Our methods rely on representations of a generalized version of the Temper ley Lieb algebras, which 
we define. We believe that the main achievement here is that we demonstrate how to handle non-unitary 
representations, and in particular, we are able to prove universality using non-unitary matrices. This 
might open up the way to applying non-unitary representations in completely different contexts, and 
perhaps will serve as an important advancement in the understanding of the physical Potts model case, 
which remains as the main open question. 

In our proofs of completeness, we have only given examples in which BQP completeness can be proved. 
In fact, many more points can also be shown to be BQP complete; In this work we have only taken the 
effort of showing a few examples, but the characterization of which points exactly can be proven to be 
quantum hard or complete is left for a further study. 

A far reaching hope is that progress in the direction of the hardness of the Potts model parameters, 
would lead to insights regarding open questions in the field of statistical physics, and in particular, 
understanding the conditions for rapid mixing in the Potts model, and its relation to the phase transition 
of the Potts model. 

Another very interesting question is whether it is possible to find an algorithmic way to calculate 
the optimal grouping, that would derive the best approximation window, or even some approximation of 
it. This would allow us to define the problem in a more natural way, without artificially providing the 
partition in the input. 

One more interesting question is whether these results can be generalizes to non-planar graphs. 
Last but not least, it would be extremely interesting to see our methods applied to other approximation 
algorithms of complete problems, which are not described by the Tutte polynomial. 

Organization of paper: 

In Sec. [2] we start with some background on the Tutte polynomial and its connection to the Potts 
model and to the Jones polynomial. We proceed in Sec. [3] to establish a well-known mapping of a planar 
graph to its medial graphs which is a 4-regular graph. On the latter, we define the Kauffman bracket, 
which is equivalent to the multivariate Tutte polynomial of the original graph, and which we will mainly 
use. Section sec:kauffman:tl defines our generalization of the Temperley Lieb algebras, and provides the 
way to write the Kauffman bracket in terms of a Temperley Lieb algebra scalar element. Section [5] gives 
the exact representations we will use, called the path model representations. Section [5] first states the 
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exact version of Theorem 1 together with the definition of the size of the approximation window, and 
then provides the algorithm and proves that it works. 

In the second part of the paper, we prove the resuhs concerning the hardness and completeness. 
Section [7] proves the universality proof for two cases: one is the case where the parameters are such 
that the matrices are almost unitary, namely, they are a scalar times a unitary. This already provides 
many new points in the Tutte plane, but is essentially the same proof as in the Jones case [?]• Then we 
provide the universality proof for the non-unitary cases (both complex and real). It is here that we need 
to use the novel ideas of how to apply the Solovay-Kitaev theorem despite not being in the safe zone of 
unitarity. Finally, in Sec. 1101 we define the version of the problem in which the grouping of the graph is 
also provided, and prove the completeness of this version. 

2 Background: The Tutte polynomial 
2.1 The Tutte polynomial 

The multivariate Tutte polynomial (which will often be referred here simply as the "Tutte polynomial" ) 
is defined for finite graphs with weighted edges and a scalar q. Below we give a short definition and 
description of this important polynomial. Our treatment and notation closely follow Ref [30j . which we 
strongly recommend as an introductory text for this subject. 

Given a graph G = (V, E) and a subset of edges A C E, we define k{A) to be the number of connected 
components in the subgraph (V, A). Notice that an isolated vertex is considered a connected component, 
hence when A is empty, k{A) = \V\. The multivariate Tutte polynomial is then defined by 

Definition 2.1 (The multivariate Tutte polynomial) Let G = {V,E) be a finite graph with vari- 
ables V = {ve} assigned to its edges e Cz E. Then the multivariate Tutte polynomial of G is a polynomial 
in V and an extra variable q, defined by 



When considering a particular set of values of the variables v = {ve}, it is customary to call them weights, 
and G a weighted graph. In addition, it is common to substitute the variables v = {v^} with a single 
variable v and obtain a two-variables polynomial 




(1) 



A<ZE 



eeA 




(2) 



ACE 
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As an example, consider the graph G — 
terms Uke these: 



with all edges set as Ve = v. Its expansion contains 




k{A) = 1, |A| = 5 =^ Term = qv' 



k{A) = 2,\A\=3=^ Term = q'^v^ 



k{A) = 3, |A| = 1 =^ Term = q^v 



2.2 Relation with the standard Tutte polynomial 

When the variables v = {ve} arc all equal to the variable w, it is easy to relate the resulting polynomial 
Zo{q,v) to the standard Tutte polynomial Tc{x,y) [TH pp. 45] which is defined by 



TG{x,y) = ^ (x- i; 



k{A}-k(E} 



\A\+k{A}-\V\ 



(3) 



ACE 



Indeed, a simple algebra yields 

rG(x,2/) = (x-l)-'=(^)(y-l)-|^IZG((.T-l)(y-l),y-l) . (4) 

In other words, TQ{x,y) and ZQ{q^v) are essentially equivalent under the change of variables 

X = 1 + (jf/u , y = 1 + u , (5) 
g= (x- 1) , v^y-l. (6) 

2.3 Relation with the Potts model 

In this section we elaborate on the connection between the Potts model and the Tutte polynomial. A 
broad review of this model, in a more physics-theoretical context, can be found in Ref [25j . 

The Potts model, which is also known as the g-state Potts model, is physical model that is defined on 
a graph G — for some q E TL^. We call the vertices of the graph sites. They hold physical objects 

like "atoms" or "spins", which can be in one of q possible states (colors, or spin states) {1, . . . A 
coloring of all sites is called a configuration of the system and is denoted by a map a : V —^ {I, . . . , q}. 

The energy of the system is defined by assigning each edge e £ E, a, coupling constant Jg. Then if ei 
and 62 denote the adjacent sites of e, and (T(ei), cr(e2) denote their colors respectively, the energy of the 
edge is 
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where Sa,b is the usual Kronecker deha. Therefore the energy of an edge is — Je if its sites share the same 
color, and is zero otherwise. The total energy of the configuration a is now 

eeE eeE 

In statistical physics we assume that the system constantly changes its microscopic configuration. The 
actual state of the system is therefore of little importance, and instead we try to estimate the probability 
P(f7) of the system to be in a configuration cr, and use this information to understand the global properties 
of the system. In this context, it is common to consider the scenario in which the system is attached to 
another, much bigger system (a thermal bath), with a constant temperature T, and the two systems are 
allowed to exchange energy (heat). Then the system is described by the so-called canonical ensemble. In 
this case, the probability distribution of the system is the Boltzmann-Gibbs distribution, 

Pia) ^ -J—e-P"^'^) . (9) 

Here (3 = \/{kBT) is the inverse temperature with ks being the Boltzmann constant, ^f""" is the 
normalization factor which is called the partition function of the system, and is given by 

^Po„. drf^^_;3^^(,) ^ (10) 
a 

The partition function Z'^"^^" has a central role in statistical physics. Knowing it allows us to calculate 
important global properties of the system such as its average energy, entropy, heat capacity etc. 

For a positive temperature T > (equivalently /3 > 0) and real couplings Je, the total energy H{a) 
is a real number, hence Eq. ([5]) is a valid probability distribution. In such case, configurations with low 
energies are preferable. An edge with > is called "ferromagnetic" . Its adjoint sites will prefer to 
be mono-color. On the other hand, an edge with Jg < 0, is called "anti-ferromagnetic", and its adjoint 
sites will tend to have non-identical colors. An interesting case is when all sites are anti- ferromagnetic 
and T — > (equivalently [3 — > +oo). In such case, if there are configurations of the graph where adjacent 
sites have different colors (i.e., "legal coloring"), then only those configurations will have a non- vanishing 
probability. In particular, we find 

hm^ = # of g-colorings . (11) 

Deciding whether a graph has one or more g-colorings for g > 2 is a well-known NP-complete problem. 
Hence the above example suggests that calculating, or even approximating the partition function, can 
be, at least in some cases, a highly non-trivial task. Currently, most common approximations to this 
quantity use Monte-Carlo type algorithms (See Refs. |311[35], for example). 
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If we now define 

ve - e^'^ - 1 , (12) 
then a simple algebra shows that the partition function can be written as 

V) = 5] n [1 + ^Me,)Me.)\ ■ (13) 

It is far from obvious that v), which is defined separately for each positive integer q, is in fact a 

polynomial in q, but as noted by Fortuin and Kastelyn in the late 1960's [551 [H], this is indeed the case: 



Theorem 2.1 (Fortuin-Kasteleyn representation of the Potts model) For integer q > 1, and 
for all V, 

Z--(Q,v)=ZG(g,v). (14) 

A proof of this theorem can be found in Ref. [3D]. 

Approximating the partition function of the q-state Potts model is therefore equivalent to approxi- 
mating the multivariate Tutte polynomial of the underlying graph with weights given by Eq. ()12p . The 
complexity of the two problems is identical. 

Theorem 12.11 enables us to assign a probability to every coloring of a graph when integer q > 
and We > — 1 (which is equivalent to Jg being real for real temperatures). We can actually extend this 
probabilistic interpretation for non- integer q's by using the original definition (j2.ip . The idea is to view 
the elements of sum in Eq. ([T]) as the new configuration space. In other words, a configuration is defined 
by a subset of the edges A C_ E, and is assigned a probability which is proportional to OeeA'^e- 
Theorem (|2.ip guarantees that these two definitions coincide in the appropriate region. We are thus led 
to define 

Definition 2.2 (Potts parameters) We say that v, g are Potts parameters (or physical parameters) 
when 

• q is an integer larger than 0, and for all edges Ve > —1; or 

• q > and for all edges Ve > 0- 

Notice that the conditions of the second case guarantee that q'^f"^) IleeA is real and positive. Also note 
that in the first case, Jg > (ferromagnetic case) corresponds to > 0, and Jg < (anti-ferromagnetic) 
corresponds to —l<Ve< 0. In both cases, we are assured that Zciq^v) is a real positive number. 
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Figure 1: From a planar graph to a medial graph. In the planar graphs, the weights Vi are assigned to 
the edges. These weights are then transformed to the crossing weights Ui in the medial graph, which are 
related by Vi = dui. 

2.4 Relation with the Jones polynomial 

Another interesting special case of the multivariate Tutte polynomial of planar graphs is the Jones 
polynomial. By constructing the so-called "medial graph" , which will be defined in the next section, one 
can translate a planar graph to a knot in the three dimensional space. Furthermore, for a particular 
choice of weights and there is a simple connection between the Tutte polynomial of the original graph 
and the Jones polynomial of the knot. We will elaborate on this connection in a later version of this 
paper. 

3 Background: From the Tutte polynomial to the KaufFman 
brackets 

The Tutte polynomial of a planar graph G can be recast in terms of Kauffman brackets of the medial 
graph Lq. The medial graph is constructed from the original graph in some well-defined way. This 
definition of the Tutte polynomial turns out to be more convenient for our purposes. 

We first define the medial graph. For each planar graph G we can build 4-regular graph Lq which 
is called the medial graph of G. It is obtained from G by first encircling the facets of G with lines, and 
then crossing the lines that surround each edge by putting a 4-regular vertex in its middle. 

The regions of the medial graph can be colored in a consistent way by black and white, such that the 
regions on two sides of an edge are colored by opposite colors. Its black and white coloring is uniquely 
determined by setting its outmost area to be white. It is easy to convince oneself that this coloring is 
unique and well defined. An example for such construction is given in Fig. [T] 

The vertices of the medial graph, which we also call crossings^ are assigned weights, denoted u = {we}, 
where Ue is the weight of the crossing corresponding to the edge e in G. If G is a weighted graph, then 
one can define a natural, one-to-one mapping between its weights v — {ve\ to the weights u = {u^}. The 
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nature of this mapping will be clear shortly, when we define the KaufFman bracket of a medial graph. 

To define the Kauffman brackets of Lq, we first define the notion of a state. A state u = {ce|e G E} 
is an assignment of or 1 to every crossing in G. a is understood as a prescription of how to open every 
crossing in Lq in one of two ways: ""^XT or ^ . When a^. = \ the shaded areas are connected 



(and consequently the original edge e of G is included in the resulting shaded area), while when = 
the shaded areas are disconnected (and the edge is not included in the shaded area). Once all crossings 
have been opened, the resulting diagram contains only loops. We let \a\ denote their total number. 
The Kauffman brackets of Lq is defined as the following sum 

{Lg) ^Y.(^gW) . (15) 

(T 

where (Lg|(t) is defined by 

(LG|a)=dHn<^ • (16) 

In other words, we take the product of dl'^l with the u^. of all crossings that were opened in such a way 
that their edge was left alive. We note that our definition of the Kauffman bracket is different from 
the standard definition of the Kauffman brackets, which is originated from the theory of knots. The 
main difference is that in our definition we allow a different weight for each crossing, whereas in the 
standard definition only one weight is used. It might therefore be more appropriate to call this object 
the "weighted Kauffman bracket" . However, for the sake of brevity and at the risk of a slight confusion, 
we will continue to call it simply the Kauffman bracket. A definition of the standard Kauffman brackets 
and their relation to Knots theory, the Jones polynomial, the Potts model, and many other topics in 
mathematical physics can be found in Refs [HI HH US] . 

We will now show the connection between the Kauffman bracket of Lq and the multivariate Tutte 
polynomial of G. 

Claim 3.1 For any scalar d and a set of weights u = {ue\, the following equation holds: 

{LG){d,n)=d-\''\ZG{d\du) . (17) 

Proof: The proof is an adaptation of well-known ideas due to Kauffman from the subject of the Jones 
polynomial in Knots theory |18j . Each state <t of the medial graph Lg uniquely defines a subgraph of G 
whose edges are those with CTc = 1. Denote this set of edges hy AQ E. At the same time a corresponds 
to a particular opening of the crossings. The opening is such that each facet of the subgraph (V, A) is 
encircled by exactly one loop. Therefore the number of loops, |(t|, is exactly the number of facets in 
(y, A). Now the number of facets in (V, A) is simply k{A) + c{A), where c{A) is the number of circles in 
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the subgraph. Using the identity \V\ + c{A) = k{A) + \A\ (see, e.g, [50] Sec. 2) we have 

\a\ = k{A) + c(A) = 2k{A) + \A\ - \V\ . (18) 

Hence, 

ACE eeA ACE eeA 

= d-\^\ZGid\du) . (20) 

□ 

Therefore the KaufTman bracket of Lq is proportional to the Tutte polynomial of G with q = and 
V = du. 

4 The KaufFman brackets in terms of the Generahzed Temper- 
ley Lieb algebras 

Our first goal is to design an algorithm that calculates Kauffman brackets of a given medial graph. How 
can the Kauffman bracket be evaluated? The main mathematical tool is the Temperley Lieb algebras. 
It enables us to break the medial graph into small pieces, which are then translated to local operators. 
Here, we use a generalization of the well-known Temperley Lieb algebras to an algebra of pictures with an 
arbitrary number of strands, so that we can talk about product of elements with non-matching numbers 
of in- and out-going strands. We call it the GTL{d) algebra. It is this definition that enables us to deal 
with all planar graphs, and not just graphs that are related to braids - which have a fixed number of 
strands. 

4.1 The Generalized Temperley-Lieb algebra GTL{d) 

The GTL{d) algebra is an algebra that is generated by diagrams of the form that is shown in Fig. [2J Its 
precise definition is as follows: 

Definition 4.1 (The GTL[d) algebra) The GTL{d) algebra is an infinite dimensional algebra that is 
defined for every scalar d. Its basis elements are represented classes of diagrams that are made from a 
finite number of strands that connect n lower pegs to m upper pegs. The diagrams must not contain any 
crossings and loops, but they may contain local minima/maxima. Two diagrams, belong to the same class 
if they are topologically equivalent, or if one diagram can be obtained from the other by adding a number 
of straight strands to its right. See Fig. [H We will denote algebra elements by capital calligraphic letters 
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m=5 m=5 m=7 




Figure 2: Three different diagrams that represent the same GTL{d) basis element. The first two have 3 
in-pegs and 5 out-pegs and are isotopic to each other. The third one has 5 in-pegs and 7 out-pegs and is 
a result of adding two straight strands to the right of the second diagram. 

such as T, A,B, . . ., and diagrams by Gothic letters T, 21, 03, . . .. Notice that every GTL{d) basis element 
is represented by an infinite number of diagrams, but every diagram represents only one basis element. 

Product Rule: For every two basis elements 7i,72, the product T3 ^ T2 ■ 7i is defined by the diagram 
that we obtain by placing a diagram that represents T2 on top of a diagram that represents 7i and 
matching their pegs starting from the left. // the number of lower pegs in T2 is different than the 
number of upper pegs in 7i we pad one of them with straight strands until they match. 

Identity Element: From the above definition it is easy to see that the identity element of the algebra, 
X, corresponds to diagrams with only straight strands, in which the i 'th bottom peg is connected to 
the i 'th upper peg. 

Relations: 

• Two isotopic elements in the GTL{d) algebra are egual. 

• Two elements in the GTL{d) algebra are egual if a diagram of one element can be obtained 
from a diagram of the other element padded with straight strands at its right side. 

• A single loop is equal to the scalar d times the identity T. The parameter d is a (complex) 
number which is a fixed parameter of the algebra, and is called the loop value. 

• An element in the algebra which contains (but is not isotopic to) an empty loop, is equal to the 
same element, except that the loop is removed, and the result is multiplied by the loop value d. 

We extend the GTL{d) algebra to linear combinations of basis elements by extending linearly the 
product rule. 
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4.2 Tangles and elementary tangles 

It will be convenient to talk also about tangles. These are diagrams, which are similar to the diagrams that 
represent basis elements of the GTL{d) algebra, except that they may also contain loops and weighted 
crossings. We use them to denote certain linear combinations of basis elements in GTL{d). 

Following the product rule in Definition [131 a- diagram with £ loops denotes the GTL{d) basis element 
of the corresponding diagram without the loops, times a factor of . 

We use a tangle with exactly one crossing and a complex weight attached to it, to denote a different 
linear combination of GTL{d) elements. This is done as follows: let T be such a tangle, with one crossing 
weighted with the weight u. We perform a black-and-white coloring of the tangle, starting from its 
left-most area, which is always painted in white. We then look at the two possible ways of opening 
the crossing: connecting or disconnecting the two shaded areas of the crossing. Let X be the GTL{d) 
element that corresponds to the diagram in which the shaded areas are connected, and let y e GTL{d) 
correspond to the diagram in which the shaded areas are disconnected. Then we use the diagram T to 
denote the following GTL{d) element: 

T^uX + y. (21) 

Of course, if the tangle contains more than one crossing, we can apply the above procedure on all crossings, 
recursively, thereby associating T with a linear combination of basis elements of GTL{d) and loops. It is 
easy to convince oneself that this definition is independent of the order in which we open each crossing, 
and so the mapping from tangles with weighted crossings to GTL{d) is well defined. 

Next, we define the notion of elementary tangles^ which are simple tangles that can generate all 
tangles. 

Definition 4.2 (Elementary tangles) A tangle is called an elementary tangle if it is represented by 
one of the following diagrams (up to isotopy and/or addition of straight strands at the right) 

1. Cup Ai The cup diagram Ai is the diagram which is identical to the identity 2 diagram except for 

the upper i,i + 1 pegs, which are connected to each other. The diagram thus contains a minima after 
the i — I strand. 

2. Cap Bi Similar to the cup diagram, Bi is identical to the identity diagram except for having a maxima 

which is created by connecting the lower i,i + 1 pegs. 

3. Crossing Ci{u) The crossing diagram Ci{u) contains a crossing with weight u between the i, i + 1 

strands, while all the other strands are trivial. 

An illustration of these three diagrams is given in Fig. O 
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A crossing 



Figure 3: An illustration of the elementary tangles (from left to right) ^3, S3 and C-i{u) 



Notice that the three elementary tangles are not independent. Indeed, it is easy to see that for odd i, 
Ci{u) = uX + Ai ■ Bi, whereas for even i, Ci{u) = I + uAi ■ Bi. It turns out that it is only the cup and 
cap tangles that are needed to generate the full GTL{d) algebra. Nevertheless, we will find the crossing 
operator extremely useful to describe the algorithm and the universality result. 

For a tangle that corresponds to a basis element we can define the number of in-pegs to be the number 
of lower pegs in the diagram, and the number of out-pegs to be the number of upper pegs in the diagram 
(see Fig. [5]). This definition can be extended to general tangles, since by opening a crossing we obtain 
two diagrams with the same number of in-pegs and the same number of out-pegs. We note that these 
numbers are not well defined for GTL{d) elements, but just for tangles, since the definition of the GTL{d) 
elements allows adding a number of straight strands to the right without changing the element. 



4.3 Connection between the Kauffman brackets and the GTL{d) algebra. 

Our goal is to calculate the Kauffman bracket {La){d, u) of a medial graph using the algebraic structure 
of the GTL{d) algebra. We do this by defining a map which associates a medial graph with an element 
in GTL{d). This map was essentially defined in the previous section. Indeed, we may simply consider 
an embedding of Lq in as a tangle. Then, according to the previous section, '^diLc) is a linear 
combination of basis elements of GTL{d), and hence is an element of GTL{d). 

Moreover, as noted in the previous section, tangles have a well-defined number of in-pegs and out- 
pegs. It is easy to see that 5*^(^0) has zero in-pegs and zero out-pegs, and is actually equal to a scalar 
times T. This is because as we open up its crossings, we end up with tangles that contain only loops; 
these are then proportional to T by the loop-value relation. It turns out that the coefficient in front I 
element is exactly the scalar we are looking for! 

Proposition 4.1 ^^(^0) = (iG)(d,u)X. 
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Proof: Opening all the crossings of the Lq element in accordance with Eq. (PT|) . we are left with a 
sum of 2l^l terms. Each term corresponds to a possible opening of all the crossings, i.e., to a state a of 
the Kauffman bracket. It is a diagram with only loops and must therefore be proportional to 2. The 
proportionality factor is exactly (Lda). Indeed, assume that the opened diagram contains £ loops and 
corresponds to a state a. Then the proportionality factor is times for every crossing with <Te = 1. 
This is because such opening joins the shaded areas and is therefore multiplied by a Ue factor according 
to Eq. (|2ip . Summing up all terms we obtain the desired result. □ 

This shows that if we want to approximate the Tutte polynomial, or rather, {LQ){d^u)^ it suffices to 
approximate the coefficient in front of 2 in "ildiLc). 

4.4 Connection with Representations of GTL(d) 

To approximate the above coefficient, we consider representations of the GTL{d) algebra. We will define 
the representation /or diagrams rather than for abstract GTL{d) elements, as it would depend on the 
particular in-pcgs and out-pegs of the diagrams. 

Definition 4.3 Let {Hn\ he a series of Hilbert spaces (which do not have to be different from each 
other). A map p that maps diagrams of tangles to linear operators over the Hilbert spaces is said to be 
a representation of GTL{d) if it maps every tangle diagram S with n in-pegs and m out-pegs to a linear 
operator p{%) : Hn Hm such that: 

• /5(T) only depends on the isotopy class of T 

• For diagram T with n in-pegs and m out-pegs, p{%) is a linear transformation from Hn to Hm ■ 

• Preserving the multiplicative structure: If Ti has n in-pegs and m out-pegs and %2 has m 
in-pegs and £ out-pegs then p{'^2 ■ "Ji) = p{'^2) ■ pC^i)- 

• Preserving the additive structure: If 1i,l2 represent the GTL{d) elements 71, 72 and T3 
represents the GTL{d) element ciTi + C2T2 then ^(Ta) = ci/9(Ti) + C2p(X2)- 

We will later provide a definition of such a representation of GTL{d), called the path-model represen- 
tation. Nevertheless, any representation will do to evaluate the desired coefficient: 

Claim 4.1 If p is a representation of GTL{d), then p{^d{LG)) ~ {LG){d,u)l. 

Proof: This is true because any representation will take the identity T to the identity operator 1. □ 

We can thus perform our desired approximations on the image of a representation of the GTL{d) 
algebras. 
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Remark: Wc note here an important difference between the connection that is made here between the 
representation of the GTL{d) algebra, and the Kauffman bracket, versus a similar connection that was 
used in [6]. In [6], a similar result was true for TL„{d), a restricted version of GTL{d), in which the 
number of strands was finite, and fixed. The representation however, was required to exhibit a property 
called the Markov property, for the connection to the Kauffman bracket to hold. Here, due to the fact 
that a single algebra element is associated with a class of diagrams, which have different in- and out-pegs 
numbers, no such property is needed, and any representation will do. This is an advantage of working 
with the GTL{d) algebra. 

We now proceed to describe the representations we will work with. 

5 Background: The Path Model Representation of GTL{d) 

We use the path model representations. They are essentially the representations used in Ref. [5], except 
here we allow also non-Hermitian representations, and also consider the infinite algebra rather than finite 
versions of it. In addition, in accordance with Definition l4.31 the representation that we describe is defined 
for diagrams of tangles, rather than for the tangles themselves, since it depends on the number of in-pegs 
and out-pegs of the diagram (it docs not depend, however, on the particular geometry of the diagram 
- only on its topology). Specifically, for each diagram we define a linear transformation with different 
domain and target spaces, the dimensions of which, depend on the number of in-pegs and out-pegs of the 
diagram. We begin by motivating the definition, and then move to the exact definitions. 

5.1 Diagrams as operators over paths on an auxiliary graph 

Let T G GTL{d) be a basis element, which is represented by a diagram T with m in-pegs and n out-pegs. 

We would like to associate the diagram with an operator. How can this be done? We draw T and 
confine it to a rectangular box, thereby dividing the box into a set of disjoint regions. Then the n lower- 
pegs of the diagram, which are connected to the lower edge of the box, divide it into n + 1 gaps, while 
the upper pegs of the diagram divide the upper edge of the box into jti + 1 gaps. We would like to treat 
T as an operator acting on a sequence of labels of the upper gaps, taking it to a sequence of labels of 
the lower gaps. For this, we use the notion of an auxiliary graph F, and allow the labels of the different 
regions of the boxed diagram to be vertices of F. Further more, we restrict the labeling of the diagram 
to be such that two adjacent regions are labeled by adjacent vertices in F. We observe that under those 
conditions, the sequence of labels of the upper gaps, as well as that of the lower gaps, is in fact a path in 
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Figure 4; Labeling of a diagram according to the vertices of an auxiliary graph F. 

the auxiliary graph F, and that the diagram can be seen as a transformation the takes the path of the 
lower gaps to the path of the upper gaps. As an example, see Fig. H) 

We will thus view 1 (and through it T) as an operator on paths on F. We can now define the Hilbert 
space of paths formally. 

5.2 The Hilbert Space of Paths on F 

For simplicity, denote the vertices of the auxiliary connected graph by {1,2, 3, ...,}. We define a, family 
of Hilbert spaces corresponding to paths of length n of F, as follows: 

Definition 5.1 (The spaces For any integer n > 0, the n-steps paths space Hn of an auxiliary 

graph F is a Hilbert space that is defined as follows: let p = {pi,p2, ■ ■ ■ ,Pn+i} denote an n-steps path 
on F which starts at the vertex pi and ends at the vertex Pn+i- We demand that pi ~ 1 and that pi 
is adjacent to Pi+i in F (denoted as pi ^ Pi+i)- Then we associate every path p with a vector \p) , and 
consider the set {|p)} of all n-steps paths to he an orthonormal basis for the space Hn- 
Note that Hq is a one dimensional space which is spanned by the vector 

5.3 Compatible paths 

Let X be a diagram with m in-pegs and n out-pegs that represents a basis element of GTL(d). We 
associate X with a transformation p{T) : iJ„i —>■ H„ , by defining {p'\ p{%) \p) for every n-steps path 
\p) = . . . ,Pti+i) and m-steps path \p') ~ \p'-^^ . . . The term {p'\ p{1) \p) will be zero, unless 

the two paths are compatible in the following sense: 

Definition 5.2 (compatible paths) Let % be a diagram of a basis element, or a basis element with 
closed loops, with n in-pegs and m out-pegs. Then the n-steps path p and the m-steps path p' are called 
compatible with respect to if, when labeling T 's lower gaps with the vertices of the path p, and the 
upper gaps with vertices of the pathp' , the result is a diagram where all connected surfaces have the same 
label. The labeling is done from the left to right with respect to the order of the vertices in each path. 
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Figure 5: Local minimum and maximum, associated with ak^e and b^j respectively. 

It is clear that the definition depends only on the isotopy class of the diagram. 

The upper path and lower path in Fig. Hare thus compatible. We define the value of {p'\ p{1) \p) to 
be zero if the two paths are not compatible. What about compatible paths? This is defined in the next 
section. 

5.4 The value {p'\ p(%) \p) for compatible paths 

Consider a diagram T of a basis element and two compatible paths p,p' with respect to it. As T does not 
contain any loops, it is easy to see that p,p' determine the labeling of all areas in S uniquely. To define 
(p' I p('J) \p) , we consider all the local minimums and maximums of the diagram that represents T. The 
area near each such point is labeled by two labels - adjacent vertices from F. A local minimum labeled 
by i from above and k from below is associated with a constant a£jt, and a maximum with the same 
labeling is associated with hi^k (see Figure [5] for illustration). 

We will soon assign values to ai^k and bl^k^ and discuss their properties. As an example, the minima and 
maxima of the diagram in Fig.l^are associated (from top to bottom) with the variables: 04.2, 02,1, &1.2, 0,1,2 
and 62.3- We define {p'\ p{1) \p) to be the product of all the constants that appear in the diagram. The 
definition of p can thus be summarized as follows: 

Definition 5.3 (Tiie path-model representation) Let T be a basis element with a diagram 1, and 
p,p' compatible paths with respect to T. Then p,p' determine a unique labeling of T and {p\ p{1^) \p') is 
the product over all local maximums and minimums of T of the corresponding ai^k o-nd bi^k coefficients, 
which are determined by the labeling. 

For non- compatible p,p' , {p\ p{^) \p') 0. 

5.5 Extending the definition beyond basis elements of GTL{d) 

Thus far we have only defined p{-) for diagrams of basis elements. We would like to extend the definition 
to any diagram that represents a tangle. This is easily achieved by linearity. Recall that the tangle 
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diagrams have a well-defined number of in-pegs and out-pegs, which is equivalent to say that they are a 
linear combination of basis elements diagrams with the same number of in-pegs and the same number 
of out-pegs. Consequently, the operators that represent these diagrams have the same domain and range 
spaces, and therefore we can define the representation of the tangle's diagram by linearity. 

This defines p{T) for all tangle diagrams, and satisfies the additivity requirement in Definition 14.31 
For p{-) to be a GTL{d) representation according to Definition 14.31 we still need to verify that it only 
depends on the isotopy class of the diagrams and that it preserves the multiplicative structure of the 
algebra. 

5.6 Conditions on au/-, b^/ for p to be a representation 

To show that p(-) is a representation of the GTL{d) algebra we must assert the following: firstly, that 
isotopic elements define the same transformation - after all, our definition of relied on a particular 
geometrical representation of T. Secondly, we must show that the representation preserves the product 
rule, i.e., p(T2Ti) = p{l2)p{'ii)- The following claim provides sufficient conditions for this to hold. 

Claim 5.1 The following two conditions guarantee that p(-) is a representation of GTLid): 

y e,k: be,k ■ ae^k = 1 , (22) 
y k : ^ ai^kh/ = d . (23) 

Proof: 

To show that two isotopic elements arc given the same image by p, we use the fact that two isotopic 
elements are linked by a series of moves that eliminate or create a pair: a local maximum and a local 
minimum. Accordingly it will suffice to prove invariance for a move that creates or eliminates a single 
pair. 

Let T' be a diagram that results from T by removing a single pair of minima and maxima. A pair 
of terms: bi^k, o,e.k will be missing in the expression for {p'\p{1') \p). Now, by using Eq. (|22p we have 
be,k ■ ag^k = 1 and thus p(T') = p{1). 

Next, we prove that p(-) is a homomorphism. Given two diagrams Ti, T2 that represent basis elements, 
we would like to prove that 

p(T2-Ti) = p(T2)-p(Ti) . (24) 

The diagram T2 • Ti can represent either a basis element, or contain some loops. Assume the first case, 
and let the paths p,p' be compatible with it. Then there is only one path p* on the border between the 
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two diagrams, such that p,p* are compatible with Ti and p*,p' are compatible with T2- Therefore if 
denotes the summation over all possible paths on the border between the diagrams then 

^ (p'l \p") {p"\ \p) = {p'\ pil2) \P*) (p1 \p) = {p'\ pil2'll) \P) ■ (25) 

The first equality is due to the fact that p* is the only path that is compatible with both p' and p. The 
second equality follows from the definition of {p'\ p{%2) \p*) and p(Ti) |p); they are products of the 
o-kji bk,i coefficients that correspond to the labeling induced by p,p' . Multiplying them gives the product 
of all the Ofc/, bk.e coefficients in the composite diagram of T2 • Ti. 

Assume now that T2-Ti contains one loop, which by isotopy has only one maximum and one minimum, 
and let be equal to T2 • Ti without the loop (hence dT^ = 72 • 7i for the corresponding GTL{d) tangles) . 
Then there is more than one labeling that is compatible with p,p'. Consequently, the sum 

J2{p'\pi'^2)\p"){p"\pi1i)\p) , (26) 
p" 

is equal to the summation over all possible labeling of the diagram T2 • 2^1 which are compatible with 
p,p' . All are identical except for the label of the internal region of the loop. Therefore if k is the label of 
the external region of the loop, and £ is the label of the internal region, then we may write 

^ (p'l p{l2) b") P{1i) \p) = {p'\ P{1,) \p) E ■ (27) 

p" tl~k 

By Eq. (^5]) . the RHS of the equation is equal to d {p'\ ^(13) \p), and as T1T2 = dTs for the corresponding 
GTL{d) tangles, we get 

^ (p'l p(T2) \p") {p"\ p(Ti) \p) = {p'\ p(T2 • Ti) \p) , (28) 

p" 

as required. The case of more than one loops follows easily by similar arguments. □ 
5.7 Setting ak/,bk/'- Path representations for all d 

In the previous section we proved that given an auxiliary F and a corresponding set of coefficients ak/, bk,e 
that satisfy the conditions in Definition 15.31 with respect to d, we can construct a representation of the 
GTL{d) algebra. We will now show how to find such coefficients for every complex d. 

Consider the graph in Fig. [6] which is the one-sided infinite line. Denote this graph by Foe, and 
let Moo be its adjacency matrix. Given any complex d, we define the infinite dimensional vector tt = 
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Figure 6: The auxiliary graph Faa is one-sided infinite Une graph. 
(ti"! , 7i'2 , TTs , ■ ■■), as foUows: 

^1=1, (29) 

7r2 = d, (30) 

^3 = - 1 , (31) 

i (32) 

7r„ = (i7r„_i — 7r„_2 , (33) 



It is easy to see that tt is an eigenvector of M^o with an eigenvalue d: when we apply the adjacency matrix 
Mao as an operator on tt we get (Afoo7f)i = = d = dni, and for all n > 2: (MqcTt) 
7r„_i + {d-Kn — 7r„_i) = o?7r„. 

We use 7f to define the path representation for almost all values of d: 

Definition 5.4 Let d be such that in the above definition all coordinates of tt are non vanishing. For 
such d, the Path Representation of GTL{d) using the graph F^o is defined by: 

dof ,„.s 

ai,k = \ — , (34) 

V ""fc 

, dcf /TTfe 

oe.k = \ — ■ (35) 

V ""f 

Lemma 5.1 The map given by definition ] 5. 4\ satisfies the constraints 12S\ \23 \) . and thus, it is a repre- 
sentation of GTL{d). 

Proof Equation ^ trivially holds. For Eq. 



Eae.kbkj = — = — y^\Moo]k,eTr( = — [MooTr]^ = d . (36) 



TTl- TTl- TTl- 



In the first equality we used the definition of a^jt, bk^g. In the second equality we replaced the summation 
over ^'s which are adjacent to fc by a summation over all using the fact that the adjacency matrix 
\Mao\k,i is equal to one when k £ and vanishes otherwise. Finally, in the last equality we used the fact 
that 7f is an eigenvector of A/qo with an eigenvalue d. □ 

We now need to deal with the cases in which the vector tt vanishes somewhere. To do this, we simply 
cut the graph Fao before the first location where the vector vanishes. Say that m sites remains in the 
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graph; call this graph Fm- It is easy to see that the vector given by tt cut to that point, namely the first 
m coordinates of tt . is an eigenvector of eigenvalue d of the adjacency matrix of which is simply the 
left upper most m x m block of the matrix Mao- 

The same construction of Definition 15 .41 will now work for these values of d, except that now the graph 
being used is the finite graph F^- 

5.8 Hermitian representations 

We conclude this section with a definition of an Hermitian representation of the GTL{d) algebra. These 
types of representations have few nice properties, which are particularly important for the BQP-hardness 
result. 

Definition 5.5 (An Hermitian representation of the GTL{d) algebra) A representation p of the 
GTL{d) algebra is call Hermitian if p(2li) ~ p(*Bi)^ for every i. Here 21^ and *Bi are diagrams which 
represent the elementary cup and cap tangles Ai, Bi, which were defined in Sec. \4.1\ For the above 
condition to make sense, we require that the number of out-pegs of 21^ be equal to the number of in-pegs 



For the family of path representations we have the following corollary: 

Corollary 5.1 A necessary and sufficient condition for a path representation to be Hermitian is that the 
coordinates of the eigenvector n are all positive. 

Proof: It is a simple exercise to verify that a path representation is Hermitian if and only if a^ ^ = i'k i 
for every £ ^ k. Therefore by Eqs. (|34I35|) . we get 



which is equivalent to the condition that ni/nk is positive for every £ k. Assuming that the auxiliary 
graph is connected and that we have normalized tt such that tti = 1 , we conclude that the representation 
is Hermitian if and only if ff > 0. □ 

We finally remark, without proof, that for the general path-representation, which uses F^c , a sufficient 
condition for Hermiticity is that d = 2cos{t: /k) for integer fc, or that d > 2. These are the representations 
that are mostly used in the literature. However, as we already noted, there is no need for the representation 
to be Hermitian for our quantum algorithm to work, but see Sec. [7] for the implication of non-Hermiticity 
on the question of universality. 

This conclude the section on the Path-model representation. Throughout this and the previous 
section, we paid a special attention to the distinction between GTL{d) elements and the diagrams that 



0f^^. 




(37) 
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Figure 7: A nicely embedded planar graph G is a graph which is attached with a particular nicely 
embedded medial graph. 

represent them, since the path-model representation is only defined for the diagrams - not for the elements 
themselves. From here onwards, however, we will not make that distinction when it is clear from the 
context that wc already picked a particular diagram to represent a GTL{d) element. We will therefore 
occasionally talk about p{T) and about the in-pegs and out-pegs of T, even though these concepts are 
not- well defined. 

6 The quantum algorithm 
6.1 Statement of Result 

To state the result, we assume that the graph G is given to us not as an adjacency matrix, but rather, it 
is embedded in the plane in some generic way. 

Definition 6.1 Nicely Embedded A medial graph Lq of a planar graph G is said to be nicely embedded 
if it is given as a diagram in R^, in such a way that if we sweep a horizontal line from the bottom of the 
tangle to the top, the horizontal line meets only one elementary tangle at a time - The minimum of a 
cup, the maximum of a cap, or a crossing. 

Like all isotopic facts, it is easy to see but probably much harder to prove, that any medial graph 
can be given such an embedding. Moreover, any graph can be given as a diagram in such that the 
resulting medial graph will be nicely embedded. Therefore we abuse language and say in this case that a 
graph G is nicely embedded when wc implicitly refer to a particular nice embedding of Lq in IR^. From 
now on we assume that the graph G is given to us in such a nicely embedded way. Obviously, such a nice 
embedding induces an order on the elementary tangles of the medial graph. A simple example is given 
in Fig. [3 
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The algorithm works by assigning to each of the elementary tangles (a crossing, a cap or a cup) a 
linear operator, according to the representation p, and applying them in the above mentioned order. 

Definition 6.2 (Prom a medial graph to elementary tangles) Given a nicely- embedded medial graph 
Lq, translate it to a product of elementary tangles by decomposing it into a product of cups, caps and 
crossings according to the order that is determined by its ( nice ) embedding in . 

The scale in which the algorithm will work is defined by 

Definition 6.3 (Algorithmic Scale) Let G = {V, E) be a nicely embedded planar graph with weights 
V = {wg} and q a complex number. In addition, let p(-) be the path representation of GTL{d) with d"^ = q. 
Then its medial graph, Lq can be translated into a product of basic tangles Ti ■ . . . ■ T/v • The scale of the 
algorithmic problem of approximating ZG{q,'v) Aaig, is then 

A,,/=^^-i^i/2||p(ri)||-...-|Kr^)|| . (38) 

In other words, Aaig is the product of the norms of the operators that correspond to the elementary 
tangles that make up Lq, times an overall factor q^'^l/^, which connects {LG){d,u) to ZQ{q,w) (see 
Claiming]). 

In accordance with the last paragraph of Sec. [51 when we talk about p{Ti) we actually talk about 
the path-model representation of a diagram that represents the tangle %. That diagram, however, is 
implicitly defined by the medial graph Lq, hence p{Ti) is well-defined. 

We note that Aq;^, depends not only on the graph but also on its embedding. A different way to 
orient the graph, for example, might lead to significant changes in the scale. We do not know of any 
algorithm to optimize the way to embed the graph so as to get the best (smallest) possible scale. We can 
now give the precise version of Theorem 11.11 

Theorem 6.1 Quantum algorithm: 

Given a finite graph G ~ {y,E), with a nice embedding in R^, with weights on the edges, v, and a 
complex number q, there is an efficient quantum algorithm that approximates the Tutte polynomial of the 
graph at those weights and q, to within an additive approximation /S.aiglpoly{\E\). 

After we prove this theorem, it will become clear that we have been quite wasteful in our quality of 
approximation, and that the size of the approximation scale can be made significantly smaller if one is 
dealing with non-unitary parameters. We discuss this matter in Subsection 16.31 We rigorously address 
this issue only towards the end of the paper, in Section [TUl 
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6.2 Proof of the Algorithm 

To prove Theorem 16. 11 we essentially apply the sequence of linear operators defined before, by a quantum 
computer. It will be convenient to use the following term: 

Definition 6.4 (Operator circuit) (see Ref. f^) An operator circuit is defined just like a quantum 
circuit, except the gates are only restricted to be linear operators from k to £ quantum registers, with no 
unitarity restriction. 

Consider then a medial graph with a nice embedding. There is an operator circuit associated with 
Lg, which is simply the circuit one gets by replacing each of the elementary tangles (cap, cup or crossing) 
with the corresponding linear operator by Definition 16.21 We denote the resulting operator circuit by Q. 
Note that this operator circuit acts on the Hilbert space Hq and takes it to Hq- We claim: 

Claim 6.1 {LG){d,u) = (1|Q|1). 

Proof: By definition, Q = p(^^{Lg)^ . But by Proposition UTTl we have that ^'(Lg) = {LG)id,u)J, and 
since p is a representation, p{T) = 1. Therefore, = (LG){d,u) (1| 1 11) = {LG){d,u). □ 

In order to prove Theorem 16.11 we need to show that we can approximate the value of (1|Q|1) to 
within and additive approximation of the scale Aaig. To do this, we simply create the state |1) e Hq and 
apply the operators in Q on this state one by one. To approximate the inner product of the resulting 
vector with |1), we use the Hadamard test [S]. 

6.2.1 Moving to qubits 

We first note that we need not consider sites in F with indices that are bigger than the number of edges 
in G = (V, E), since we start from 1 and each crossing in Lg can only increases the path by one more site. 
This means that we can encode the name of a vertex in F with logarithmically many qubits (in \E\). We 
view the Hilbert space now as composed of registers; each register contains logarithmically many qubits, 
and can hold a label of one vertex in in the range 1 —i- \E\. 

Notice that each elementary crossing acts locally only on three labels of a path. Hence its operator 
under the path representation is actually the tensor product of the identity with an operator that acts 
on three local registers. The other elementary tangles, namely the cup and the cap, also yield local 
3-registers operators, as we will see in the end of the next subsection. Therefore all the operators that 
we shall apply act non-trivially only on a logarithmic number of local qubits. 
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6.2.2 Simulating a linear operator. 

Claim 6.2 Given a linear operator M : Hk —>■ Hk that acts on a constant number of local registers and a 
quantum computer, it is possible to efficiently transform any normalized vector \a) G to a normalized 
vector 1/3) S Hk ® IB, with IB being the space of an auxiliary qubit, such that the following holds: when 
the auxiliary qubit is projected to \Q) , the resulting state on Hk becomes jjjY^M\a), where \\M\\ is the 
operator norm of M . In other words, |/3) ~ M \a) ® |0) + c I7) (g) with I7) being some residual 
state and c a constant such that the overall state has unit norm. 

From this claim it follows that wc can apply Q cfBcicntly, up to some normalization factors. Wc will 
first prove the claim, and then worry about the normalization factors. 
Proof of Claim O 

According to the discussion in the previous section, if M acts on on a constant number of local registers, 
then it can be viewed as an operator that acts on ©(logdSj)) local qubits. By the polar decomposition 
lemma, we can write M as a product of two matrices, one unitary and the other is positive definite, 
M = PU . Both matrices will still act non-trivially only on O(log(|i?|)) local qubits. 

The simulation of unitary operators that act on logarithmically many qubits is a standard procedure 
in quantum computation, and can be done in polynomial time. Hence we can simulate U efhciently. 

We turn to the simulation of the positive definite matrix P. P can be diagonalized by an orthonormal 
basis, and so we assume without loss of generality that P is diagonal in the computational basis, otherwise 
we can always change basis back and forth efhciently. Let ri > r2 > . . . > > be the eigenvalues of 
P, with the corresponding eigenvectors |1) , |2) , . . . , |m). We wish to apply P/ri. To do this we set the 
auxiliary qubit to |0), and apply a unitary transformation P^ that satisfies 

Pu{ N) |0) ) = - \i) ® |0) + - {n/r^Y \i) ® |1) . (39) 

We think of the auxiliary qubit as an indicator to the validity of the original transformation; up to 
the overall factor ri, we get the original transformation only if the auxiliary qubit is |0). This is exactly 
achieved by projecting the auxiliary qubit on |0). Therefore, the application of P is recovered by applying 
Pii, projecting the auxiliary qubit on |0), and multiplying the final result by ri. Finally, note that as ri 
is the maximal eigenvalue of P, it follows that = ri. □ 

We conclude that we can apply any desired operator, provided we book- keep its norm: we actually 
apply a matrix with a unit norm. Moreover, for each operator we get an extra qubit. This qubit must 
be projected to |0) for the rest of the state to hold the desired result. 

To complete the algorithm we need to deal with the operators that correspond to the cup and cap 
tangles, which are not square operators. Their action, however, is local. They can be viewed as operators 
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that transform one register to 3 registers (the cup operator), or vice versa (the cap operator), while 
leaving the rest of the registers intact. 

The cup operator is simply the concatenation of the operator |fc) > |fc, 1, 1), with another 3x3 linear 
operator: \k, 1, 1) i-^ J^e-e^^k 1^' ^' ^^"^ '''^^^ '^^ ^^"^ vectors go to (the scalar) 0. We can apply the 
first operator easily by allocating two more registers. The second operator can be applied by the previous 
lemma. Likewise, the cap operator is defined by \k, m) i— > Sk^mbk,i \k, 1, 1), which can be applied by the 
lemma, after which we simply discard the last two registers. 

6.2.3 Completing the proof of the Algorithm with the Correct Approximation window size 

Once we have applied all operators, we need to compute the inner product of the final state, with the 
state |1) tensor with |0) on all the ancillary qubits. The final state is the following normalized state: 

^(Q|f))®|0)®|0)...|0)+c|7) , (40) 

alg 

where I7) is some residual state in which one or more of the ancillary qubits are different from |0), and 
^aig ^ rf'^'^^^a/g, 1-6., it is simply thc product of all the norms of the elementary tangles that make up 
Q. 

It is a well known fact that a quantum computer can efficiently estimate the inner product of such a 
state with (1| (g) (0| . . . (X) (0| to get an approximation of (I| Q |I), using the Hadamard test. Repeating 
this test poly{\E\) times would result in an approximation of — (1| Q |1) to within an additive window 

alg 

of size l/poly{\E\) with exponentially good confidence. Finally, we use the recall from Eq. (fT7|) and 
Claim EII] that 

{l\Q\l) = -^{LG){d,u) = -^Zciq^yr) . (41) 

^alg ^alg ^alg 

This completes the proof of Theorem 16.11 

6.3 Improving the approximation window 

When the crossings operators arc all unitary, thc scale Aaig does not increase by applying them; the norm 
of a product of unitary operators is equal to the product of their norms - which is equal to 1. However, 
the quality of the approximation might worsen severely when we apply non-unitary operations. It is 
easy to be convinced that thc approximation scale we define, and in fact, our algorithm itself, are quite 
wasteful. For example, if one combines several operators together, thc product of their norms is very 
likely to be larger than the norm of their products, and we will lose a lot in our approximation scale. As 
long as thc product of thc operators is still an operator on polylogarithmically many qubits. there is no 
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problem in calculating the product of them all as one matrix (this calculation is done classically on the 
side), and then perform it as a whole by the quantum computer, in the same method as in the previous 
subsection. This changes the contribution in the approximation window from the product of the norms 
to the norm of the product. 

The only problem is that we do not know of a way to find the optimal grouping that will do the 
job. Any grouping would improve the approximation window, however. And, it is often the case, that a 
classical algorithm that is given the graph can easily find an excellent grouping. 

In Section [TO] we will redefine the problem such that the input in fact includes the description of a 
suggested grouping, in which case the approximation window will be much better, namely the product 
of the grouped operators. 

7 BQP Hardness 

Having the algorithm at hand, we would now like to show the hardness of the problem it solves. In this 
section we will prove a precise version of Theorem 11.21 

Without loss of generality, BQP-hardness can be described as the following problem. We are given a 
description of a quantum circuit over n qubits as a product of = poly{n) 2-qubits gates 

U^Un-...-Ui, (42) 

which are local operators (i.e., they act on two neighboring qubits), taken from a well-defined basis. Then 
jO*^") is the tensor product of n |0) qubits, and we are promised that either | (O*^"] U |0^") P < 1/3 or 
I (O*^"! U\0'^") p > 2/3. We are asked to decide, within poly{n) time, which of these possibilities holds. 
We will show that under certain conditions, there exists an efficient algorithm that takes the description 
of the quantum circuit and outputs a weighted graph G such that ZG{q,-v) (times a known constant) 
approximates | (O**"! [/|0®") p. It will follow that certain additive approximations of ^^(g, v), for var- 
ious choices of the variables, are BQP hard. Before we show that, however, we need some preliminary 
definitions. 

7.1 Statement of the Result 

We wish to show that approximating the Tutte polynomial of a given planar graph, with a given set of 
weights, to within some additive approximation scale, is BQP hard. We will be able to show this only for 
weights taken from some particular sets, which will be defined later. On the other hand, we also restrict 
the type of graphs we take as input to the BQP-hard problem (restricting the type of graphs only makes 
the hardness result stronger): 
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Definition 7.1 (Input graphs) A planar graph G ~ {E,V) with a nice embedding in R^. Its medial 
graph Lq is required to have the following structure: it is a concatenation of 2n cups which are placed 
in a line one next to the other, followed by poly{n) crossings between the resulting An strands, which are 
then closed by a line of 2n caps. An illustration of this kind of medial graph is given in Fig. O 

The weights of such graphs need to satisfy certain requirements too. 

Definition 7.2 (Odd and Even Weights) The set of weights from which the edge weights of the input 
graph are taken from is denoted by W . It is required to be the union of two types of weights: odd and even 
weights (which need not be non-intersecting) : Wodd o.nd Weven- We demand that an edge that corresponds 
in the medial graph to a crossing i ^ (i + 1) with odd (even) i, is given a weight from Wodd (Weven)- 
The subset of these edges is denoted by Eodd (Eeven)- Obviously Eodd^ E^ven = and Eodd^ E^ven — E. 

We will restrict W to be one of three sets: The unitary set, the complex non-unitary set, and the real 
non- unitary set. Wc will wait with the exact definitions of these sets of parameters and provide them 
when the definition is motivated. Meanwhile, we will use the names of these sets abstractly. 

The computational problem is defined by: 

Input: An input graph G = {V^E) as in Definition 17.11 with edges weights from a set W as in Defi- 
nition 17. 2[ where W is either the unitary, the complex non-unitary or the real non-unitary set of 
parameters. 

Scale for BQP hardness: 

A,,./=^^I^H^-I ( n ^'e) . (43) 

Output: Approximate ZG{q,v) to within additive approximation window of size poiy\\v\) ^hard- 

Note that the scale here does not seem to be the same scale as that the algorithm works with. Indeed, 
whereas the algorithmic scale corresponds to the product of the norms of the operators, the hardness 
scale corresponds to, roughly, the product of their determinants, as we will see later. Wc will address 
this mismatch in the scales only in Section (TUl Wc will show in Theorem II . II that in fact, in the unitary 
case the two scales are equal, which follows from the fact that for a unitary matrix, both the norm and 
the determinant arc equal to 1. In the non- unitary case we will somewhat modify the problem to make 
the algorithmic scale smaller, so as to match the hardness scale in (j43|) . 

The following theorem is the precise version of Theorem 11.21 and is the second main result of the 
paper: 
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Figure 8: The type of medial graph that is used in the definition of the BQP-hard problem. Note that 
the lower 2n cups need not be exactly on the same horizontal line so that Lq is nicely embedded. The 
same goes for the 2n caps. 

Theorem 7.1 Solving the above problem, for W which is either the unitary set, the complex non-unitary 
set or the real non-unitary set, to within an approximation window of size po;y(|y|) '^hard, is BQP hard. 

The proof of Theorem 17. II is essentially given in the rest of this section, and in Sections [Sj [H For sake 
of simplicity we first present its outline. 
Outline of the proof: 

To prove the above theorem we will encode the n-qubits strings and the 2-qubits operators from Eq. (|42p 
inside the Hilbert space of the path representation. We will then find GTL{d) elements whose image under 
the path-representation approximates the 2-qubits gates. This is done by proving that using the weights 
in W we can construct a set of operators that generate dense group in the space of the 2-qubits gates. 
Consequently we will be able to use the famous Solovcy-Kitaev algorithm to efficiently find elements 
T G GTL{d) whose p{T) approximates the different gates. From the concatenation of all these GTL{d) 
elements we will be able to find the desired graph G = {V, E). 

The GTL{d) operators that we consider can be either unitary or non- unitary. Although the general 
structure of the proof is the same in both cases, the details vary. Hence, we will give a single unified 
proof whenever is possible, and split the discussion only at the level of the finest details. In particular, 
we will show that in the unitary case, the operators form a dense subgroup of S'[/(14), whereas in the 
non-unitary cases, it is a subgroup of SL{14,Ii) or SL{14,<C). In addition, the famous Solovey-Kitaev 
theorem, which is used in the unitary case, will be generalized and applied to the non-unitary cases. 
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7.2 Encoding a quantum gate by Crossing operators 

7.2.1 The 4-steps encoding 

Our goal is to encode the n-qubit Hilbcrt space within the Hilbert space Hk of k steps (Sec. [S]). We use 
the hnear graph Foo that was used in Sec. 15.71 and choose q (and hence d) such that non of the first 3 
coordinates of tt will vanish. This ensures us that we can use paths over at least three different vertices 
in Foo. It is easy to deduce from Eqs. (|29II33|) that any q ^ 0,1 satisfies this requirement. 

The encoding is done using the so-called 4-steps encoding, which was independently discovered by 
Kitaev [T3], and Wocjan and Yard [TS]. In this encoding, every qubit is represented by a 4 steps path: 

|0) 112121) e Hi , (44) 

ID 112321) G Hi . (45) 

A tensor product of qubits is naturally translated to a concatenation of paths, hence a string of ii qubits 
is encoded in a 4n steps path, and the space of n-qubits is encoded in H^n. Obviously Aim. H in > 2". 
The 2" subspacc of Hin that corresponds to encoded n-qubits is called the legitimate subspace Lin, and 
therefore we may write Hin = Lin ® Lj^n- 

We denote by || • H^^^ the norm over Lin. This norm can be defined for every vector in Hin by 
first projecting it onto Lin and calculating its norm there. This naturally defines an operator norm in 
Lin, which will be used later. Notice that the Lin norm is a semi- norm, since non-zero vectors with no 
projection on Lin will have a vanishing norm. 

With the 4-steps encoding we can write every unitary operator U over n qubits as an unitary operator 
over Hin- Indeed, if U is given by 

U = Y. 01 ' (46) 

id 

(here \i) is a short-hand for a n-qubits tensor) then its encoded version is 

U = J2 U^, Ii) ill + 11- • (47) 

Here l^^j- denotes an operator that is identity on Lin and zero on Lin, and therefore U_ is non-trivial 
only on the legitimate space Lin- It is easy to verify that the encoding preserves the product structure 
of the quantum circuit, i.e., U_ = C/^ • . . . • U_i. 

7.2.2 The crossing operators 

Roughly speaking, our goal is to find an element % G GTL{d) for every encoded gate C/j, such that 
p(7i) ~ C/j over Lin, where p{Ti) is the image of % under the path representation. Notice that % must 
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Figure 9: An illustration of the GTL{d) element £^2- 

have k in-pegs and k out-pegs for some k < An, in order for p{7i) to be an operator from H^n into itself. 

The following GTL{d) elements and their images under the path representation are the building blocks 
of 7-: 

Definition 7.3 (The £^ e GTL{d) elements and the operators) The £i G GTL{d) is the prod- 
uct £i AiBi (see Sec \4--1^ - In other words, it is a basis element with i + 1 in-pegs and i + 1 out pegs, 
that is defined by i — 1 straight strands, followed by a cap-cup couple in the i,i + 1 strands. An example 
of £2 is shown in Fig. O 

Under the path representation it is be mapped to an operator $i '= p{£i), from Hf^ to itself, for every 
k>i + l. 

The £i elements are often considered as the standard generators of the Temperley-Lieb algebra TLn (d) 
(see Ref and §12.4 in [1^). It is easy to see that £f ~ d£i and consequently (f>| ~ d^i. Moreover, for 
Hermitian representation p{Bi) = p{AiY (see Sec. I5.8p and consequently $i = p{AiBi) = p{Ai)p{AiY is 
Hermitian. 

Using the operators $i we can now define the crossing operators <Ti{u): 

Definition 7.4 (The crossing operators (T,;(w)) The crossing operators are the image, under the path 
representation, of a GTL{d) elementary tangle Ci{u), which contains a crossing between the i ^ (i + 1) 
strands while being trivial on the rest (see Sec. \4-l^ . They are denoted by ai{u) : Hk H^, with u 

dcf 



being the weight that is associated with the crossing: CTiiu) = p\Ci{u)y There are two types of crossing 
operators: crossing operators with odd i and crossing operators with even i. Using the black-and-white 
coloring convention, it is easy to write them in terms of $i .• 

{wl + <f>i , for odd i 's , 
(48) 
1 + u^i , for even i 's 

7.2.3 Encoding a 2-qubit Gate: how to start? 

The main idea in the proof is to show that we can approximate every encoded two qubit gate over 
the space of legitimate 8 steps paths, using the 7 operators a'i{u), i = 1, . . . ,7 with a prescribed set of 
weights W , provided that W satisfies the required restrictions, which we are still to give. 
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Figure 10: The diagrams of 14 possible vectors \pi) that span the space K — i/g.i^i 



Note that the space is not invariant under these seven operators. Hence, we wiU consider a larger 
space, the subspace K = iJg.i^i, spanned by the 8-steps paths space iJg that start and end at the first 
site of the auxiliary graph. Notice that ig C -ft' C i/g. In Fig. [TO] we have listed the 14 relevant paths 
that span K schematically. 

The idea is therefore to approximate any unitary matrix on ig, by approximating its extension to 
K, which applies the identity on the remaining 10 paths. Clearly, if ||r— T^Uffgi^i — ^ then also 
\\T — V\\Lg < S, as required. In order to do this, we show that the subgroup generated by the seven 
operators is dense in SU{K) - the group of unitary transformations over K. For that matter we will have 
to restrict the parameters we will deal with in various ways. The treatment will be completely different 
depending on the type of operators that the parameters correspond to; we discuss the different types of 
parameters next. 
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7.3 Parameters 

7.3.1 Sets of parameters Closed to Inverses 

The reason for restricting the parameters set W is so that we are able to prove the density, in some 
relevant group, of the subgroup generated by the crossing operators. More specifically, every Ui in the 
quantum circuit (|42p is a 2-qubits gate, and its encoded gate C/j works on the space Lg, of legitimate 
8-stcps paths. We wish to approximate, using the crossing operators, either the unitary or the orthogonal 
group over using the 7 operators (Ti(it), i ~ 1, . . . , 7 with a prescribed set of weights. 

Our first restriction on the set guarantees that we can treat the set generated by these operators as 
a group, by requiring that the set is closed to inverses: 

Definition 7.5 (A set of weights closed to inverses with respect to q) For a given q, a set of 
weights W — Wodd U Weven is said to be closed to inverses with respect to q, if Wodd and Weven are 
non-empty and satisfy the following requirements: 

• For every odd weight v G Wodd there must be another odd weight w S Wodd such that w + v + q = 0. 
w is then called the odd inverse weight of v, and vice versa. 

• For every even weight v e Weven there must be another even weight w € Weven such that w+v+wv — 
0. w is then called the even inverse weight of v, and vice versa. 

The following claim justifies why we call these weights inverses of each other. 

Claim 7.1 Consider two odd weights v,w €z Wodd such that w + v + q — 0. Then the (odd) crossing 
operators a i{v / d) andai{w/d) satisfy 

vw 

a,{v/d)a,{w/d) = — 1 . (49) 

q 

Similarly, for two even weights v,w € Weven such that v + w + wv = 0, we have 

ai{v/d)a,{w/d) = 1 . (50) 

Proof: We use Eq. (05]) and the identity $f = □ 

We note that there is a slight abuse of language here; in the odd case, the operators are not really 
inverses but there is an overall factor wv/q in front of the identity, which means that ai{w/d) is only 
proportional to the inverse of ai{v/d). As we shall see, this is not a significant problem. 
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7.3.2 Types of parameters 

Essentially, the parameters we can handle are divided to three different types, according to the three 
different types of operators they correspond to. 

Definition 7.6 (Parameters of unitary type) A set of parameters, {q, W) is said to be of unitary 
type if for d ~ y^q, the path-model representation is Hermitian (i.e., d = 2cos7r/fc for an integer k > 
or d > 2, according to Definition \5.5\) 
we have and additionally, 

\v + q\ = \v\ , for all V S Wodd , (51) 
+ = 1 , for allv E W^ven ■ (52) 

Claim 7.2 Fix a set of parameters which is of unitary type. Then when restricting attention to the 
subspace K, for any even i and v £ Weven "we have that Uiiv/d) is unitary, and for every odd i and every 
V G Wodd we have that ai{v/d) is equal to a unitary times an overall factor u = v/d. 

Proof: For the values of d for which the path representation is Hermitian (see Sec. 15. 8p . the unitarity is 
easy to prove. In this case, the $i's operators are Hermitian (see discussion below Definition 17. 3|) . and 
can diagonalized by an orthonormal basis. Therefore, the crossing operators, being a linear combination 
of <f>i and 1 are diagonalized by the same basis. Moreover, as $f — d^i, its eigenvalues are either d or 
0. Therefore the eigenvalues of the odd crossings operator are {v/d, v/d + d}, which by Eq. ([5T|) are of 
the same magnitude - |f/rf|. It follows that in the odd case ai{v/d) equals a unitary operators time an 
overall factor v/d. 

Similarly, in the even case the eigenvalues of the crossing operators are {1, 1 + 17}, and the restrictions 
in Eq. (|52p guarantee that both have a unit magnitude. 
□ 

Definition 7.7 (A set of parameters of complex non-unitary type) A set of parameters {q, W) 
is said to be of complex non-unitary type if none of its weights satisfies the conditions of Definition \ 7. 6] 
and at least one of the following happens: there is at least one weight which is not real, or q is not real. 

Definition 7.8 (A set of parameters of real non-unitary type) is defined similarly to the above, 
except all weights, and q, must be real. 

In the following section we further restrict these three types of parameters to get three families of 
parameters, one of each type. 
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7.3.3 Three families of parameters 

The core of the universahty proof is to show that the generators defined above, for three famihes of 
parameters, generate a dense subgroup of the unitary group on the 14 dimensional subspace K (or 
the orthogonal group in the real non-unitary set of parameters.) To show density, we have to restrict 
our parameters even further. The exact definition only requires that W would contain four weights: 
vi G Woddi V2 £ Weveri and their inverse weights. The definition can be elegantly casted by using the 
parameters a and (5: 

a = ^l + qv^^ , 13 = . (53) 

Indeed, we first notice that in this notation, \a\ = |/3| = 1 is equivalent to the conditions of unitarity in 
Definition [7161 



Definition 7.9 [The unitary set of parameters (case I)] 

We define the sets of parameters {q^W) which are contained in our unitary family. First, W is closed 
to inverses with respect to q, and (q,W) satisfy the requirements of Definition \7. 6\ In addition we require 
that q 7^ 0, 1, 2, '^^^ , and that there exist vi S Wodd, V2 € Weven such that the following holds: 

• c? , 0^ 7^ \, and 

• At least one of the two real numbers si, S2 G [0, 1) which are defined by 
is not a rational number p/r with gcd(p, r) = 1 and r = 1,2,3,4,5. 



(54) 



Definition 7.10 (The complex non-unitary case (case II)) |a|, |/3| ^ 1 and at least one of the fol- 
lowing inequalities holds: 



1 

a 

a 

„ 1 



g-1 



/3 



q- 



1 

a 

a 

1 



P- 



< \ , or 



< 1 



(55) 
(56) 



In addition to that, at least one of the numbers q, vi , V2 is not a pure real. 



Definition 7.11 (The real non-unitary case (case (3))) 5*07716 conditions on a, (3 as in the previous 
case, but now vi,V2,q are real numbers with q > 4cos^7r/5. 



44 



7.4 Density and Efficiency on the subspace K 

In this section wc state and prove the central theorem on which universaUty resuh builds. This is the 
density and efficiency theorem on the subspace K , Thcorcm l7.7l Essentially it assures us that any unitary 
gate on K can be efficiently approximated by the crossing operators to within any accuracy. To prove 
it, we first prove density, and then use the Solovey-Kitaev theorem for efficiency. The density theorems, 
however, arc given in terms of the normalized versions of the crossing operators that wc define below. 

Definition 7.12 (The normalized crossing operators) We define the normalized operators (Ji to he 
equal to ai, up to a constant, determined by the requirement that det{(Ti) ~ 1, where the determinant is 
calculated only in the subspace K . In other words, ai ai/ [det(CTi)]"'^^"'^. 

7.4.1 Density Theorems 

Using the normalized generators for the above three families of parameters, we can prove density in the 
relevant groups: 

Theorem 7.2 (Density on K, the unitary case) For {q,W) a unitary set of parameters (case I), 
the normalized operators {cri{v/d), ■ ■ ■ ,ct'j{v / dy\^^w qenerate a dense suhqroup of SU{K). 

Theorem 7.3 (Density on K, the complex non-unitary case) For {q, W) a complex non-unitary 
set of parameters (case II), the normalized operators {ai{v/d), ■ ■ ■ , a'/{v/d)}v^w generate a dense sub- 
group of SL{K, (D) . 

Theorem 7.4 (Density on K, the real non-unitary case) For {q,W) a real non-unitary set of pa- 
rameters (case III), the normalized operators {ai{v/d), - ■ ■ ,a7{v/d)}y^iY generate a dense subgroup of 
SL{K, R). 

The proof of these density theorems is non-trivial, and is deferred to the next two sections of the 
paper. 

7.4.2 Prom Density to Efficiency 

We proceed assuming these density results. The idea is that if we have density, then any quantum 
gate can be written as a product of a sequence of crossing operators. Thus, the quantum circuit can 
be approximated by a sequence of crossing operators. For this, we should of course claim that the 
approximation is efficient, and can be found efficiently. The proof is essentially a simple application of 
the famous Solovay-Kitaev theorem, where we also have to account for the normalization factors correctly. 
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which requires using a simple trick with commutators. In addition, since the Solovay-Kitaev theorem 
appHes only for SU{K), we need to generalize it to our purposes. 

Let us first recall the Solovay-Kitaev theorem in the version which will be most useful to us: 

Theorem 7.5 (The Solovay-Kitaev theorem) Let M he a Hilhert space with dim A/ > 2 over (D. 
Then there exists an eq > which only depends on dim A/ such that if Q is an eQ-net over the special 
unitary group SU{M), then for every transformation V G SU{M) and every S > there exists a sequence 
S of polyilogS^^) elements from Q, which can he found in poly (log S^^) time, such that \\V — S\\ < S. 

To deduce efficiency from density in the non-unitary case, we can no longer apply the Solovay kitaev 
theorem, which holds for SU{K). To this end, we will prove a generalization of the Solovey-Kitaev 
theorem that holds for density in S'i(A', (D), in which the generators of the eg-net may be close - but 
not necessarily inside SU{K). For the S'i(A', R) case we will prove yet another generalization of the 
algorithm which works on the special-orthogonal groups SO{K) instead of SU{K). The modifications of 
the SK theorem are not difficult technically, and so their proof is deferred to the Appendix. 

Theorem 7.6 (The non-unitary Solovay-Kitaev theorem) Let M he a Hilhert space with dimAf 
> 2 over the field F, which can either he (D or ]R, let R > he some finite radius, and let Q = {gi, . . . , g^} 
he a finite set of generators in SL{M,F). Define Be.{M) to he the set of all transformation in SL{M,F) 
whose distance from the unity transformation is smaller than R. Then there exists an eo > which only 
depends on dimAf and on R, such that if Q forms an e^-net over Bii{M), then for every transformation 
V e Bii{M) and every S > there exists a finite sequence S of poly{\ogS^^) generators from Q, which 
can he found in poly{logS^^) time, such that \\V — S\\ < S. 

Proof: The proof is given in Appendix |B] □ 

We now use the Solovay-Kitaev Theorem and its non-unitary variants to prove that indeed, the 
generators can be used to approximate any given gate efficiently, with the appropriate overall factor. In 
addition to the Solovay Kitaev theorems and the density theorems, we also need to take a correct account 
of the overall factors. 

Theorem 7.7 (The density and efficiency theorem) Let q = and W helong to one of the allowed 
families. Then if W is unitary or complex non-unitary, then there exists a classical algorithm that takes 
a 2-quhits operator V G SU{A) and a numher e > 0, and outputs the description of a tangle T £ GTL{d) 
with 8 in-pegs and 8 out-pegs such that 

||i:-p(r)/AT|L, <e, (57) 
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in poly{\oge^^) time. T is equal to a product of poly{loge^^) elementary crossings with weights taken 
from W/d, and A7- is an overall factor, given by the product of all the odd weights v/d that appear in T . 
If W is real non-unitary then there is a similar algorithm for any operator V G 5*0(4). 

Proof: Let us start by proving the density and efficiency theorem for the unitary case. We would hke 
to use the density theorem to create an eg net that would later be used by the Solovey-Kitaev theorem. 
The problem is that the density theorem uses the normalized crossing operators, whereas we want the 
density and efficiency theorem to be written in terms of the original crossing operators. The solution is 
to approximate the eo-net with products in which for every normalized crossing operator there appears 
its inverse. This would either cancel the normalizing factors, or would make it easy to book-keep them. 
We achieve that using the following trick: let eg be as in Theorem 17.51 and consider an arbitrary eo/2-net 
in SU{K). Each element in the net can be written as a finite product of commutators in SU{K), by the 
fact that the commutator group of SU (K) is equal to SU (i^ jj. 

Consider then an element X in the eo/2-net which is written as a product of m commutators. All 
together X is written as a product of 4m transformations from SU{K). By Theorem 17.21 the set of 
normalized crossing operators generates a dense subgroup in SU{K). Therefore by approximating every 
element in the product to within eg/ (8m), we can approximate X to within eo/2 by a finite product 
of normalized crossing operators. If we now replace all the elements in the eo/2-net with their eo/2 
approximations we get an eg-net that consists of finite products of crossing operators. As we shall see 
shortly, the important thing is that because we use commutators, then each element in the new net 
contains, for each normalized generator, its inverse. 

We can now apply the Solovay-Kitaev Theorem for this eo-net. This implies that we can approximate 
y_ by poly (log e^^) normalized operators with weights taken from W/d, where each normalized operator 
appears in the approximating sequence the same number of times as its inverse. 

The operators we actually have to use are the unnormalized ones. Since ai = det(cri)T4 (f^^ we get 
that we have to calculate the product of the determinants of all the crossing operators involved, and then 
take the power 1/14. To prove the theorem, we need to show that this factor is indeed equal to Ar, the 
product of all the odd weights vjd that appears in the approximation sequence. This is where we use the 
fact that for each generator we also have its inverse. 

The determinant of a product of matrices is equal to the product of the determinants; to calculate 

^This follows from the following argument: the commutator group is a normal closed subgroup of SU (K), which contains 
an infinite number of elements. Now SU{K) is an "almost simple Lie group", that is, the quotient group of SU{K) divided 
by its center is a simple abstract group. In addition, it is also a connected group. It follows that any normal closed subgroup 
of it must either be in its center (hence, bo finite) or be the whole SU{K), and therefore the commutator group of SU{K) 
is equal to SU{K). 
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the determinant, we group together the unnormalized versions of ai and a~ . For i even, the product 
of those unnormahzed operators is 1, by Eq. (j50|) . For i odd, the product of ai{v/d) and Uiiw/d) for 
v^w being inverses with respect to q, is equal to ^1, by Eq. The determinant of this operator, 

calculated on K, is (■^)^'^. Taking the power 1/14, we get that the contribution of this pair to the overall 
factor is iiH = £H which is indeed the correct contribution to Ar- 

q a a ^ 

We now address the complex non-unitary case. The proof follows exactly the same lines as for the 
unitary case, and is based on the non- unitary version of the Solovay-Kitacv theorem. Theorem 17.61 (The 
only differences arc minor technicalities, as we shall see). We fix R in the non unitary Solovay Kitacv 
theorem to be 3, so that Bp){K) contains SU{K, (D) (this follows from the fact that any unitary matrix 
is within distance 2 from the identity). We consider an eo/2 net in Bfi(K), where each element X in the 
net is a product of m commutators, as before (we use the fact that the commutator group of SL{K, (D) 
is equal to SL{K, (D), just like in the unitary group). Hence, each element is a product of Am elements 
in SL{K, (D). By Theorem 17.31 the normalized generators generate a dense subgroup in SL(K, (D), and 
so we can approximate each of the 4m elements to within eo/TO3™, using a sequence (of bounded length) 
of the normalized operators. The improved accuracy of eo/m3"' is required because of the non-unitarity, 
which makes the error non-additive. It is never the less easy to bound the error using a telescopic sum, 
and get that the product of the approximating sequences is within (eo/?Ti3™)m3™~^ < eo/2 of the net 
element. The rest of the argument is the same. 

For the real non-unitary case, the argument is the same, except that unitary matrices are replaced by 
orthogonal matrices. □ 

It is quite easy to finish the proof of Theorem 17.11 using the density and efficiency theorem, and this 
is what we will do next. This will complete the proof of hardness, except that it remains to prove the 
density theorems; as mentioned before, this will be done in Sections |5] and O 

7.5 Proof of Theorem 17.11 

Suppose we are given the quantum circuit (j42p over n qubits, which applies the unitary matrix U. We 
will describe an efficient reduction from approximating the Tutte polynomial to evaluating the outcome 
of this circuit. Specifically, the reduction constructs a weighted graph G = (V, E) of the form that is 
given in the theorem, such that an additive approximation of ZG(q,-v), within the window size of the 
theorem, will enable to decide whether | (0®"| U |0®") |^ < 1/3 or | (0®"| U |0®") |^ > 2/3. 
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7.5.1 The reduction: The unitary and the complex non-unitary case 

For now, we restrict attention to the case of unitary or complex non-unitary sets of parameters. The real 
non-unitary case requires some additional step, and we will deal with it at the end of this subsection. 
The reduction constructs the graph G as follows. 

1. For every one of the N gates Ui in Eq. (|^^ . use the density and efficiency theorem (Theo- 
rem 17. 7|) with q = (P and weights W to find tangles % G GTL{d) of poly logarithmic size such 
that \\A'j'^p(Ti) — C/jllig < 1/lOOOA^. As N ~ poly{n) this step is done in poly (n) time. 

2. Every tangle 71 G GTL(d) is then padded with an appropriate number of straight strands from its 
left to match the original two local qubits on which Ui operates. Specifically, if Ui operates on the 
qubits j, j + 1 then pad % with 4(j — 1) strands. It is easy to verify that now \ \A'^^ p{Ti) — U_^\\l^^ < 
1/lOOiV. In other words, we now use the L^n norm instead of the Lg norm. 

3. The tangle T = T/v ■ . . . ■ Ti is calculated. 

From the assertion \ \A:j-^ p{T,) ~ U^Wl^^ < 1/lOOOiV, and using a standard telescopic argument, we 
have ||A^^p(T) — IZ||l4„ < 1/100, where Ar Yi^Ti is the product of all odd weights v/d that 
appear in T. Therefore, as lO**") G L4n, we get 

A^^ (0®" I p{T) 1 0^") - (0^" I U 1 0®") < 1/100 . (58) 

4. To calculate (0^"| p{T) |0®") we define a new GTL{d) element Tp = B-T -Ahy muhiplying a 2n 
cups element Ahy T and then by a 2n caps element B. 7p has the form of the medial graph that 
was defined in the hardness problem (see, for example. Fig. [S]). We can therefore find a weighted 
graph G = {E, V) whose medial graph Lq corresponds to Tp, and its weights are taken from W. 
Notice that in accordance with the conditions of the computational problem, every e G Eodd in G 
has a weight taken from Wodd and similarly for the even case. 

We now prove 



Claim 7.3 The graph generated by the above reduction satisfies 

1 



^hard 



-ZG(q,v)-(0®"|f/|0®") 



< 1/100 . (59) 



It follows from this claim that it suffices to provide an additive approximation of Zciq^'v) to within 
Ahard/lO, in order to be able to distinguish between the two cases: | (0®"|t/|0®") p > 2/3, | (0®"|[/|0®") p < 
1/3. We now prove the claim. 
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Proof: According to Proposition 14. 1[ Tp = {La)!, and so p{Tp) = {Lq)\. As p{Tp) is an operator from 
Hi to Hi, it follows that (1| p{Tp) |1) = {La). 

Let us now evaluate (O**"! p(T) lO**"): returning to the identity Tp = B ■ T ■ A, vje have p{T:p) = 
p{B)p{T)p{A) with p{A) : Hi i?4„, p{T) : Hi^ ^ i?4« and p{B) : H^^ ^ Hi. From Sec. E] we see 
that: 

p{A) = (a2i)'" 1121212... ) (1| =d"|0®")(l| , (60) 
p{B) (6i2)2" |1) (121212... I =d"|l)(0«5"| . (61) 

Here |1212 . . .) corresponds to the "zig-zag" path 1^2^1^2.... It is the only path in H^n that 
is compatible with the path |1) e i^i with respect to the tangles A and B. 
We conclude by Claim [3A] that 

(r"|p(T)ir"> = rf-'"(iiP(Tp)ii> = d-2"(£G) = 9-"-i''i/'^G(g,v) , (62) 

and therefore by Eq. we get 

A^ig~"-l^l/2ZG(g,v) - (0®"| C/|0®") < 1/100. (63) 

Consider now the factor A^^^/^""'^'/'^ that multiplies Zc{q,'v). We claim that it is exactly l/Ahard 
for Afiard defined in Eq. (|43|) . Indeed, by the definition of Aj- it is equal to 

q-n-\V\/2 ( II ^\ = q\E^,,\/2-n-\V\/2 ^-1 ^ (g^) 

In addition, it is easy to verify by going from the medial graph back to G that \V\ — 2n + \Eodd\, and 
therefore the factor is simply 



y 11 A 

^ ^hard 



as required. 
□ 

7.5.2 The real non-unitary case 

We now have to deal with the real non-unitary case. The problem is that in this case, the density and 
efficiency theorem, only guarantees that we can approximate any gate in the orthogonal group 5*0(4), 
whereas the gates of the quantum circuit are in S'J7(4). We therefore start the reduction by first translating 
the quantum circuit using unitary gates in SU (4) to an equivalent circuit over n + I qubits whose gates 
are real matrices in 50(4). The reduction from unitary gates to orthogonal gates is fairly standard: 
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Claim 7.4 Universal quantum computation can be performed using an array of qubits, on which two 
qubit gates taken from the orthogonal group are applied on nearest neighbor qubits only. Moreover, any 
quantum circuit can be translated to a circuit of this form with only polynomial overhead. 

Proof: Recall the proof in [?] that real matrices suffice to achieve universality in quantum computation; 
This is done by adding an extra qubit which carries the information of whether the state is in the 
imaginary part of the space or the real part. We now use this idea in the following way. We will start 
with a quantum circuit (not necessarily one dimensional) which uses only one qubit gates plus CNOT 
gates - which is well known to be a universal set. 

We translate those gates to their real versions, as is done in [?]. We note that the CNOT gates have 
not changed: in the new version, a CNOT gate goes to a CNOT tensor with identity on the extra qubit. 
The one qubit gate have now become two qubit gates. Hence, we have expressed the circuit using two 
qubits gates in the orthogonal group (namely, real). The only problem is that they may not be applied 
to nearest neighbors. We solve this in the standard way by adding linearly many SWAP gates for any 
gate; luckily, the SWAP gate is in the orthogonal group and is a two qubit gate, so we are allowed to use 
it. □ 

8 Density in The unitary case 

The proof of the density in the unitary case essentially follows the same lines of Sec. 4 of Ref [7]. For 
completeness, we will provide most of the proof from scratch, as we will build on this proof in the non- 
unitary case. We will be using two basic tools from Ref [7], without reproving them: The bridge lemma 
and the decoupling lemma. 

8.1 The overall structure of the density proof 

We now want to show density of the group generated by the normalized operators. Recall that the space 
in which we want to prove density is spanned by 14 vectors, which are denoted by \pi) and arc shown 
schematically in Fig. \lU[ 

Let us analyze the action of the normalized crossing operators a-i{u) on these vectors. From Eq. (|48|) 
we deduce that ai{u) has the same block structure of The latter has a very simple structure: 
denoting by |. . .jkl. . .) a path whose i, i + 1 steps are j — > fc ^ we find that mixes the vectors 
I- • ■ Ji J + Ij J ■ • ■) s-nd |. . . j, J — 1, j . . .) while returning zero whenever £ j or k ^ j ± 1. When j = 1, 
the path |. . . j, j — 1, j . . .) does not exists, making \ . . . j + I, j . . .) a non-trivial eigenvector of $i. 



51 



Therefore works on 1 x 1 and 2x2 blocks of K. In Table [T] we list the blocks on which the different 
<f>i's are non-zero. 



$1 


(1) 


(3) 


(5) 


(7) 


(9) 


$2 


(1,2) 


(3,4) 


(5,6) 


(7,8) 


(9,12) 


$3 


(1) 


(3) 


(6,10) 


(8,11) 


(12,13) 


$4 


(1,5) 


(2,6) 


(3,7) 


(4,8) 


(13,14) 


*5 


(1) 


(2) 


(7,9) 


(8,12) 


(11,13) 


$6 


(1,3) 


(2,4) 


(5,7) 


(6,8) 


(10,11) 


$7 


(1) 


(2) 


(5) 


(6) 


(10) 



Table 1: The block structure of the operators in K ^ iJg.i^i. Vectors that do not appear in a certain 
row vanish by the action of the corresponding <f>i . 

Consider then the group that is generated by (Ti , (T2 . Table [T] shows that there are five 2x2 blocks 
on which these operators act non-trivially: 

• ,b2)} 

• {Ips) ,b4)} 

• {\P5) ,\Pe)} 

• {|P7) ,b8>} 

• {|P9),bl2)} 

The following subsection shows that under the conditions of Theorem l7.71 ai , a2 generate a dense subgroup 
of SU{2) inside each block. We call this the seed; once we have a seed of dense unitary group on some 
small subspace, climbing up in the dimensionality can be done using fairly general tools. 

8.2 Constructing the Seed: Density in SU{2) 

We start by generating the unitary group on a two dimensional subspace. 

Lemma 8.1 (The unitary seeding lemma) For q and W as in the unitary set of parameters, the 
normalized operators (Ti,(72 generate a dense subgroup of SU{2) in each of the two-dimensional blocks 
that are listed above. 
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Proof: Fix a particular SU{2). Observe that &i,(T2 are not in the SU{2) group even though they are 
in SU (K) . This is because their determinant in each block need not be equal to 1 in order for the 
determinant in K is equal to 1 . Let ri , T2 be the operators normalized so that their determinant in the 
two dimensional subspace is 1. We will show that ti,T2 and their inverses generate a dense subgroup in 
SU{2). By the fact that the commutator group of SU{2) is equal to SU{2), it is sufficient to approximate 
commutators of SU{2), and thus, in each approximating sequence we have for every ti also its inverse. 
Thus, if we replace by di and likewise for its inverse, we will not change the product. 

Hence, it suffices to prove density of Ti,r2 and their inverses. According to the reasoning of Theo- 
rem 4.1 in Ref [7], which is based on Theorem 5.1, page 262 in Ref [T7], it is sufficient show that ti and 
T2 are non-commuting and generate an infinite group. The first part is simple; ri and T2 commute if and 
only if |ti,T2]] = |(Ti,(72]1 = 1. A straightforward calculation yields 

T.([...2l-l)^-^(/3-i)'(a-l)\ (66) 

Obviously, the two matrices are non-commuting provided that 9 7^ 1 and a, /3 7^ ±1 - all of which cannot 
happen for the unitary family of parameters. Definition 17.61 

To prove the other condition we use the canonical homomorphism ip : SU{2) 50(3) for which 
kercp = {1, —1}, and the fact that all finite subgroups of 5(9(3) are well known and have been classified. 
Let us denote by G the subgroup which is generated by ri, T2 and assume that it is finite. In such a case, 
f{G) is a finite subgroup of 50(3), and must be one of the following five subgroups: 

1. The cyclic group Z„, for some n> 1. 

2. The Dihedral group _D„, for some n> 1. 

3. The Alternating group A4. 

4. The Permutation group 54. 

5. The Alternating group A^. 

Each one of these possibilities is contradictory under the conditions of the theorem. For start, the 
eigenvalues of ti are a and 1/a which are equal to e*"*^. Similarly, the eigenvalues of T2 are e*"*^. 
Therefore if either si or S2 are irrational then the period of ti or T2 is infinite and consequently G and 
(/5(G) will be infinite. 

Assume then that si = pi/qi and S2 = P2/12 with gcd(pi,(7i) = gcd(p2,92) = 1 and that gi > 5 
(the case of (72 > 5 is similar). Then it is easy to verify that the period of ^p{ti) is qi > 5. However, 
the maximal period in A4, 54 and A5 is 3,4 and 5 respectively, hence ^p{G) is not one of these groups. 



53 



<y9(G) is also not equal to Z„ because that would imply that there exists an element U E SU{2) and two 
integers < £,m < n such that ri kcT(p = U^kcT(p and T2 kcr</7 = ?7™ker(/3, but as keri^s — {±1}, this 
would imply that ri,T2 commute. A similar argument can be applied to the D„ case. □ 

8.3 Building up in the Dimensionality 

Let us now consider what happens when wc also act with (T3. Table [T] tells us that (73 mixes |pio) with 
\pg), while, we already know that ai and (T2 generate a dense group in the SU{2) of spa'n{\p^) , jpe)} 
direct sum with the identity on |pio)0 The three operators therefore operate inside the space C 
span{\p;^) , \pq) , Ipio)} which is a direct sum of two subspaces: A span{\p^) , \p^)} and B =' spa7i{|pio)}. 
We can generate SU{A) and SU{B) (which is trivial), and wc have a transformation that bridges these 
two spaces and leaves C invariant, where we consider again the normalized version of (T3, that has deter- 
minant 1 on C . The following lemma assures us that in such case we can also generate a dense group in 
SU{C). The lemma is proved in Appendix A. 2 of Ref [7]. 

Lemma 8.2 (The unitary Bridge lemma Consider a linear space C which is a direct sum of 
two subspaces A and B, and assume that dimi? > dim A > 1. Let W G SU{C) be a transformation that 
mixes the two subspaces, i.e., W{B) ^ B. Then any U G SU{C) can be approximated to an arbitrary 
precision using a finite sequence of transformations from SU{A), SU{B) and W . Consequently, the group 
generated by SU{A), SU{B) and W is dense in SU{C). 

Using once again similar commutator reasoning, we can get rid of the normalizing factor and deduce 
that the three generators &i, gi, g-^ span a dense subgroup of SU{C). 

Returning to the space K, identical reasoning shows that we can also generate SU{3) over the sub- 
spaces span{\pr) , \ps) , bii)}, and span{\pg) , \pi2) , bi3)}- 

Let us now consider the (74 transformation. Table [T] reveals that it bridges the subspaces span{\pi) , 
1^2)}, and span{\p5) , \pq) , |pio)}, where we already have SU{2) and SU{3) densities respectively. We 
would therefore like to use the Bridge lemma to show that we have a SU{5) density on the direct sum 
of these spaces. However, the Bridge lemma cannot be directly applied here because it assumes that we 
can generate SU(2) and SU{3) independently of each other, whereas the transformations in these two 
subspaces are generated simultaneously by cTi,a2,G3, and hence might be coupled to each other. The 
following lemma comes to our rescue; it guarantees that we can always approximate any transformation 
on span{\pi) , \p2)} while leaving span{\p5) , \pe) , |pio)} invariant, and vice versa. The proof can be foimd 
in the A. 2 appendix of Ref [7]. 

^Notc that 6'i,CT2 do not leave |pio) invariant, but instead give it a phase. This phase, hovfover, is ehminatcd when we 
consider approximating commutators, in which operators appear the same number of times as their inverses. 



54 



Lemma 8.3 (The unitary decoupling lemma [7]) Let G he an infinite countable group, and let A, 
B be two finite Linear spaces with different dimensionality. Let Ta and Th be two homomorphisms of G 
into SU{A) and SU{B) respectively and assume that Ta{G) is dense in SU{A) and ti,{G) is dense in 
SU{B). Then for any U G SU{A) there exist a series {tTn} in G such that 

TaK) ^ U (67) 
n{<Jn) ^ 1 , (68) 

and vice versa. 

From here onward, the proof is straightforward. Together with (T4, we generate a dense SU{5) 
on span{\pi) , \p2) , jps) , \pe) , bio)} and on span{\ps) , \p4) , \pr) , \ps) , bii)} and S't/(4) on span{\pg), 
\pi2) 1 Ipis) , \pi4)}- Finally, (T5 and o-g bridge these three subspaces, enabling us to generate a group 
which is dense in SU{K) ~ SU{li). This completes the proof of Theorem 17.21 for the unitary case. 

9 Density in the Non-Unitary Cases 

The proof follows the same general structure of the unitary case, but with fundamental differences. It 
uses a modified versions of Lemmas 18. 1118. 31 to the non-unitary case. The main difference is in the seeding 
lemma. Whereas in the unitary case, the generic behavior is that the crossing operators generate a 
dense subgroup, in the non-unitary case this is no longer true. Density is much more difficult to achieve, 
and requires restricting the parameters severely. Naturally, the techniques for proving density are very 
different than in the unitary case. We use results by Jorgensen |36j and Sullivan [37], which enable us to 
show density in a restricted set of parameters. Wc note that providing a full characterization of all cases 
which are dense remains open. 

Once density is established in the seeding lemma, the methods that enable us to climb in dimensionality 
to the entire subspace K are fairly similar to the unitary case, though technical modifications are required 
in the proofs of the bridge lemma and the decoupling lemma. 

9.1 Constructing the Seed: density in SL{2,€) and in SL{2,K) 

It is the seeding lemma that truly distinguishes the unitary from the non-unitary cases. The behavior of 
the two cases is entirely different. As a result, the proof uses different techniques, and applies to a much 
more restricted set of parameters. The main result of this section is: 
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Lemma 9.1 (The non-unitary seeding lemma) For q and W as in the complex non-unitary case, 
the normalized crossing operators di,&2 generate a group which is dense in SL{2, (D) in each of the five 
2x2 blocks in K on which they act non trivially: 

1- {|pi>,b2>} 

2- {b3>,b4>} 

3- {b5>,b6)} 

4- {b7>,b8>} 
5. {b9),bi2>} 

For q and W as in the real non-unitary case, the group is dense in SL{2, R). 

Lemma [9.11 is essentially an application of two results by J0rgensen [36] and Sullivan [Sj- To state 
them let us first define the concept of elementary subgroup of SL{2, (D). The full definition of this object is 
intimately connected to the theory of complex Mobius transformations, for which a standard introduction 
can be found in Ref [10] . Instead of dwelling into this rich and beautiful theory, we shall quote a theorem 
by Baribeau and Ransford from Ref [41] that gives a necessary and sufficient condition for a subgroup of 
SL{2, (D) that is generated by two elements X,Y ^ SL{2, (D) to be an elementary subgroup. 

Theorem 9.1 (Proposition 2.1 from Ref [41]) Let G = {X,Y) be a subgroup of SL{2,<D), and de- 
fine the following 3 complex numbers: 

T = Tr^{X)-4., T' = Tr^{Y)-A, 7 Tr2([X, FJ) - 2 . (69) 

Then G is an elementary group if and only if one of the three conditions hold 

• t,t' e [-4, 0] and 7 e [-rr74, 0] , 
. 7 = , 

• T = 7 and t' = —4, or t = —4 and r' = 7, or t = —4 and r' = —4 . 

With this "definition" of an elementary group, we can now state the J0rgensen inequality: 

Theorem 9.2 (J0rgensen, [36]) If two matrices X,Y ^ SL{2,<C) generate a non- elementary and dis- 
crete group, then 

I Tr^{X) - 4| + I Tr{XYX-^Y-^) - 2| > 1 . (70) 
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Finally, we use a well-known result of Sullivan, which can be restated in the following form 

Theorem 9.3 (Proposition in Section 1 of Sullivan, [37|) Let G be a non- elementary and non- 
discrete subgroup in SL(2, (D). Then one of the following holds: 

• G is dense in SL{2, (D). 

• The connected component of the identity of the topological closure of G is conjugate to 5*^(2, R), 
and consequently G has a subgroup which is conjugate to a dense subgroup of SL(2,M.). 

A direct corollary of these three results is then 

Corollary 9.1 If X,Y G SL{2,<C) are two non- commuting, diagonalizable matrices with eigenvalues 
{a, 1/a} and {(3, 1//?} respectively, such that \a\, \j3\ ^ \, and if either 



then {X,Y) is dense in SL{2,<C), or {X,Y) contains a subgroup which is conjugate to a dense subgroup 



Indeed it is easy to verify that the condition on X, Y guarantees, according to Theorem l9.H that {X, Y) is 
a non-elementary group, hence using the J0rgensen inequality in conjunction with Sullivan's result gives 
the corollary. 

Returning to the proof of the seeding lemma, let us now consider, the action of ai and (T2 on the 2x2 
block of {\pi) , \p2)} (the structure of these two transformations in the other blocks is identical). There 
we have 



I Tr^{X) - 4| + I Tr{XYX~^Y-'^) - 2| < 1 , or 



(71) 



I Tr^{Y) - 4| I Tr{XYX-^Y-^) - 2| < 1 



(72) 



o/S'i(2,]R). 




(73) 



and consequently 




(74) 



(75) 
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with a, (3 that are defined in Theorem 17. II A straight forward calculation then gives 

2 



\Tr^ai - 4| 
\Tr^a2 - 4| 
|TrIai,<72]-2| 



1 

a 

a 



9 



1 





1 


2 


1 




a 




a 




a 




a 



(76) 
(77) 
(78) 



and plugging this into the previous Corollary proves most of the seeding Lemma. 

What remains to show is that for W in Case (II) we have a SL{2, (D) density while for Case (III) we 
have a SL{2, R) density. To prove the first claim we use the following lemma: 

Lemma 9.2 If H ~ gSL[2,J{)g^^ is a normal subgroup of G then the trace of every f £ G is either real 
or is purely imaginary. 

Proof: Define h = g^^fg. Then Tr{h) = Tr{f) and hSL{2,'R)h-^ = S'i(2,]R). Therefore for every 
X g S'i(2,]R, hxh~^ is a real matrix, hence h*x(h^^)* ~ hxh~^. Therefore if we define q — h^^h* then 
qxq* = X. This equality should hold for every x € SL{2, R), and in particular to the family of 



A e R 



(79) 



A 

-1/A 

Then it is a simple exercise to verify that q must be equal to 1 or —1. Consequently, h = ±h* and 
therefore Tr{h) is either real or purely imaginary. □ 

Now assume that W is in case (II), yet the G = ((5'i,(72) is not dense in SL{2,(C). Let G denote 
the topological closure of (0-1,(72). Then the connected component of the identity of G is conjugate to 
5i(2,R). and is also a normal subgroup of G. Therefore the trace of every element of G must be either 
real or purely imaginary. However, if vi, V2 or q are not strictly real then it is easy to verify that Trdi, 
Tr&i, or rr((Ti(T2), are not real or purely imaginary, which is a contradiction. 

Lastly, if W is in case (III), yet G — ((Ti,o'2) is dense in SL{2,<C), then we can continue the proof 
of Theorem 17.11 in exactly the same manner as in case (II). We would deduce that for every unitary 
circuit U which is given by Eq. we could find a graph G = {V, E) with weights in W such that 



-2n 



(n 



Zg((7, v) approximates (0*^"! U |0®") up to 1/100. However, under the conditions of 
case (HI), the former expression is purely real whereas the latter can be any complex number in the unit 
disc. Therefore G has a subgroup which is conjugate to a dense subgroup of SL{2,'^). Finally, note 
that the conjugation matrix must be trivial since for W in case (HI) the crossing operators in K are real 
matrices. This concludes the proof of the Lemma [9. II 
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9.2 Building up in the Dimensionality: the non-Unitary case 

The methods for building up in the dimensionahty, once we have the seed, are much more similar to 
those in the unitary case, even though, there too, there are many complications; however, those seem 
more technical and less fundamental than in the case of the seeding lemmas. We delay the proof of the 
non-unitary bridge and decoupling lemmas to Appendices IA.HIA.2l Given those lemmas, the proof of 
density follows the almost same lines as in the unitary case. 

Lemma 9.3 (The non- unitary bridge Lemma) Let C be an Hilhert space over the field F = R or 
F ^ (C, and Let A, B be subspaces of C such that C = A® B with dimi? > dim^. 

Let W G SL{C,F) be a bridge transformation between A and B, i.e., WA ^ A and WB 7^ B. 
Furthermore, let SL{A, F) denote all the dot = 1 transformations that act on A while leaving B invariant, 
and similarly SL{B,F) act only on B while leaving A invariant. Then SL{C',F) is generated by W, 
SL{A,F), and SL{B,F). More over, every element in SL{C,F) can be represented as a finite product 
of elements from SL{A, F), SL{B, F) and W. 

Lemma 9.4 (The non-unitary decoupling Lemma) Let A, B be linear spaces (over F — (C or F = 
Rj, and let G be an abstract group. In addition, let pA '■ G ^ SL{A,F) and pB '■ G SL{B,F) be 
two homomorphisms, and assume that Pa{G) is dense in SL{A,F) while Pb{G) is dense in SL{B,F). 
Finally, assume that if PA{9n) is a Cauchy series in SL{A,F) then there exists a subseries for which 
Psigrik) is Cauchy in SL{B,F) and vice versa. 
Then the following holds: 

• If dim A 7^ dim_B then we have a full decoupling: for every Va S SL{A,F) and Vb G SL{B,F) we 
can find a series {gn} G G such that pAidn) Va while PB{gn) ^ Vb- 

• If Ami A = 61m B then either we have a full decoupling, or, for every series {gn} in G for which 
PA{gn) — > 1 we can find a subseries {gnk\ fof which PB^guk) converges to some element in the 
center of SL{B, F), and vice versa. 

Notice that in order to use the bridge Lemma in case (3), we must be sure that the bridge operators are 
in SL{K, R). This is where the condition q > 4 cos^ tt/5 is needed: recall from definition 15 .41 that the path 
representation for d — ^/q, is determined by the coordinates of the infinite eigenvector n of the auxiliary 
graph Foo for eigenvalue d. The first five coordinates are given by tt = (1, d, d^ — 1, d^ — 2d, d^ — Sd^ + l, . . .), 
and it is easy to see that for d > 2 cos7r/5 they are all positive. These are the coordinates that define the 
<f>i operators over K . Their positivity implies that the matrices of $i in K are real. 
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If in addition to that, vi, V2 (and consequently wi, W2) are real then it follows that crossing operators 
have real matrices in K. The normalized crossing operators in that case will have matrices which are 
either real (when the determinant of the un-normalized operator is positive) or purely imaginary (when 
the determinant is negative). In the first case, they belong to SL{K, R), while in the second case we can 
always use the square of matrix, which will be in SL{K, R), as a bridge instead of matrix itself. 

Another important difference between the unitary and the non-unitary cases is found in the extra 
condition of the non-unitary decoupling lemma: to use that lemma, we must assume that for any element 
in SL{A) we can find a series that converges to it while also converging in SL{B), and vice- versa. 

This always happen for the following reason: we start the proof by looking at the subgroups that are 
generated by cti and (T2. These are either 2x2 subgroups that arc dense in SL{2), or they are trivial 
unit transformation on the one dimensional subspaccs. But since every 2x2 subspaces are isomorphic 
to each other by conjugation (this follows from the fact that cti and (72 look the same at every subspace 
up to a change of basis), it follows that at least at that initial step, our subgroups fulfill the requirements 
of the non-unitary decoupling lemma. 

But what about the next steps, after we use the bridge lemma? Also in those steps, the subgroups in 
question always fulfill that extra condition. This follows from the fact that the non-unitary bridge lemma 
promises us that it is possible to generate every transformation in SL{A © B) using a finite number of 
transformations from SL{A), SL{B) and the bridge W . Therefore, if the extra condition holds for SL{A) 
and SL{B) it must also hold for SL{A ®B).. 

This completes the proof of the density theorem in the non-unitary cases. 

10 BQP-Completeness: matching the different sizes of the ap- 
proximation window 

In this section we will match the algorithmic scale of Sec. [6] with the hardness scale of Sec. [71 thereby 
proving completeness. As stated previously, the two scales do not match in all cases; a straightforward 
matching can be done only in the unitary case. In the non-unitary case, a slight modification of the 
computational problem has to be done first; the so-called "grouping" of the crossings. We will show that 
in this form, the approximation of the Tutte polynomial for g = 3 in the non-unitary cases and weights for 
which we have hardness, is also BQP-complete We believe that our techniques can be further generalized 
to many other values of q. Such a generalization, however, is outside the scope of this paper. 
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10.1 Completeness in the unitary case 

In the unitary case, the computational problem of Theorem lG.ll is BQP-complete. Formally, we have the 
following theorem: 

Theorem 10.1 (BQP-completeness in the unitary case) Consider a unitary set of parameters {q, W), 
as in Definition \ 7. 6] Then the following computational problem is BQP-complete: 

Input: A graph G = (i?, V) which is nicely embedded in R^, with edge weights from W . 

Output: Approximate the multivariate Tutte-polynomial ZQ[q^'v) to within an additive approximation 
^aig{G) / poly{\V\) , where ^aig is given in Definition \6.3[ 

Proof: By Theorem 16.11 the above problem is in BQP. It remains to show that it is also BQP-hard. 
Consider then the conditions of Theorem 17.11 We will show that Aaig — ^hard for the special type of 
graphs that are used in the BQP-hard problem. 

Assume then that G = {V, E) a graph that is used in the BQP-hard problem, and consider its medial 
graph Lq. It is a concatenation of 2n cups, followed by poly{n) crossings (with weights taken from W), 
which arc then closed by 2n caps (sec Fig. [S]). Let us calculate the norm of each one of these elements. 
We begin with the cups/caps: 

Lemma 10.1 For an Hermitian representation, the norm of the cup (cap) operator is exactly \fd — g^/*. 

Proof: For every path p and a cup element Ai we have 

||p(A;) \p) IP = (p| \v) = {v\ Pmp{A^) \p) = (p| p{B,A^) \v) = (Pl d\ \p)^d. (80) 

The second equality follows from the Hermiticity of the representation (Definition 15. Sp . whereas the forth 
equality follows from the fact that BiAi is simply a loop, hence is equal to dX. Therefore ||/o(.4i)|| = \fd. 
Similarly, for p(Bi) we have 

||p(S0b>ll' = (p|p(A60b) , (81) 

But AiBi = £i, hence p{AiBi) = which, for Hermitian representation, is an Hermitian operator with 
eigenvalues d, (sec Sec. I7.3.2|) . Consequently, ||p(Si)|| ~ \fd as claimed. □ 

Returning to the main proof, we are left with poly(n) crossings of two types, odds and evens. Odd 
crossings are crossings between i ^ (i -I- 1) strands with odd i. Recall from Definition 17.41 that the 
operators that correspond to these crossings are a(u) — u\Ar^i with eigenvalues \u + d, u\ for u e W j d. 
In addition, the unitarity condition implies that ju + d| = and therefore the norm of the crossing 
operators is 

||cr(w)|| = |u| = , odd case. (82) 



61 



Similarly, in the even case (j(u) ~ 1 +u<I> with eigenvalues {1,1 + ud} and a unitary condition |1 + = 1 
that implies 

||cr(u)|| = 1 , even case . (83) 
Gathering all terms, we find that the product of all norms is 



,n-|B„ddl/2 



n 



hence 



A, 



alg 



\V\/2+n-\E,aa\/2 



n 



(84) 



(85) 



Finally, as already noted in the proof of Theorem l7.11 the particular structure of G yields \V\ = 2n+\Eodd\ 
hence 



Aa/g - 



JV-l-ISoddl 



n 



— ^hard ■ 



(86) 



□ 



10.2 Completeness in the non-unitary case 

Before giving an exact definition of the completeness result in the non-unitary case, let us try to explain 
the problem in a somewhat heuristic manner. 

10.2.1 Why the scales do not match in the non-unitary case 

As mentioned in the beginning of this section, in the non-unitary case, it is no longer true that Aaig = 
A/iard- To understand this, let us try a naive approach to prove the equality. 

Re-examining the argument that lead to Eq. ([55]) in Sec. [71 reveals that when we construct the GTL{d) 
element that corresponds to a quantum circuit [/, wc find elements % such that 



||p(Ti-T2-...)-Ar^||i,„ < 1/100. 



(87) 



Note, however, that unlike the notation in Eq. here we consider % to be the elementary crossings 

rather than the product of crossings that results from the density and efficiency theorem (Theorem 17. 7|) . 
In the unitary case, we examined every crossing and concluded that 



MT,)\\-\\p{%) 



= |Ar| 



Then after taking into account the norms of the cup and caps operator, it easily follows that concluded 
that Aojg = l^hard- Can we do the same in the non- unitary case? Equation ([57)1 gives us 



\\p{ri-% 



(89) 
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where ~ means equality up to an order of unity. We would like to show that the LHS is equal to 
||p(7i)|| • ||p(72)||. This is, however, not true in the unitary case, and the reason is twofold: firstly, in the 
non-unitary case, the norm of a product of operators is always smaller than or equal to the norm of the 
product. Therefore \\p{Ti • T2 • . . .)||l4,. < IIp(^i)I|l4,. ■ IIp(^)I|l4,. ■ ■ • ■• Secondly, the norm over L4„ is 
always smaller than or equal to the overall norm: • ||p(^)||l4„ ■ • ■ • < ||jo(Ti)|| • ||p(72)|| • . . .. 

10.2.2 Motivating the Solution 

Our way to overcome these problems is to take the product of elementary tangles that a nicely embedded 
medial graph defines, and group neighboring elements: 

<Si 52 53 

LG^TC~".-rC~^--TC^-^Si-S2-S:i- ... , (90) 

where 

S,'^%^-...-%^^,., . (91) 

In other words, the quantum algorithm applies the matrices that correspond to Sj, rather then applying 
the matrices that correspond to each % individually. The new algorithmic scale is thus the product of 
||p(5j)|| instead of the product of the norms of the individual operators p{Ti). ||p(5j)|| is smaller than or 
equal to the norms of the operators that make up this product. This forces us to change the definition of 
the computational problem by adding the grouping information to the input to the problem. The grouping, 
however, must be done in such a way that the resulting operators p{Si) still work on logarithmically many 
strands, so that we can first compute the exact matrix form of the operator efficiently, and then simulate 
the relevant operator efficiently by the quantum algorithm. 

The grouping of the operators gives us a lot of additional power. Even if the crossing operators are non 
unitary, we may still group them into an almost unitary (up to an overall constant) operators, thereby 
bounding their norm. This is precisely the idea of the non-unitary completeness. After all, the density 
and efficiency theorem 17.71 provides us with a natural grouping of the crossing operators. It gives us a 
product of a polylogarithmic number of crossing operators that approximate a unitary gate Ui . It seems 
that if we use this grouping as an input to the quantum algorithm, the grouped operators p{Si) will 
become (approximately) unitary (up to an overall constant), and we will be able to prove completeness. 
There is a small caveat here, however. The product of the crossing operators approximate the (encoded) 
unitary gate only over the subspace K — iJg.i^i of the 8-strands space Hg. In order for the product to 
be (approximately) unitary, we must "fix" its behavior on the other subspaces that make up . We must 
make sure that the eigenvalues of p{Si) have (approximately) the same magnitude in all the subspaces of 
Hg. Changing the definition of the computational problem is therefore not enough; we must also improve 
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the result of the density and efhciency theorem. The exact definition of the new computational problem, 
and the necessary modifications to the universality theorem are now given below. 

10.2.3 Statement of Results 

Definition 10.1 (A nicely embedded graph with grouping and its scale) Consider a graph G = 
{V, E) whose medial Lq is nicely embedded in R^. The embedding maps Lq to a product of elementary 
tangles (cups, caps and crossings): Lq t-^ 71 ■ Tz ■ . . . ■ %. A grouping of these elements is a partition of 
the product into sub-products Sj = Tij ■ . . . ■ Tl-^-^^i, as shown in Eqs. i90\ \ 91\) . The partition must be 
such that any p{Sj) operates on the tensor product of at most polylogarithmically many local registers. 
The scale of such a grouped graph is then 

\rp = gl^l/' • ||p(5i)|| • \\p{S^)\\ \\p{Sm)\\ . (92) 

The universality result for the non-unitary case can now be stated exactly, 

Theorem 10.2 (BQP-completeness in the non-unitary case) For q = 3 and a corresponding non- 
unitary set of weights W (either complex or real, as given in Definitions \ 7. 9\ 7.10^ , the following problem 
is BQP -complete: 

Input: A nicely embedded graph G = {E, V) with grouping. 

Output: An additive approximation of ZQ{q,w) to within an error of /S.grp/poly{\G\) 

To prove the theorem, we will need a stronger version of the density and efficiency theorem. As hinted 
above, we must use an alternative version of the density and efficiency theorem in which we approximate 
any two-qubits gate on the subspace K = Hg^i^i, while controlling the behavior of the other subspaces 
of Hg, so that the operator there has the same, or smaller norm as in K. 

Theorem 10.3 (The improved density and efficiency theorem) Let g = 3 and W be a corre- 
sponding set of non-unitary weights (either real or complex, as given in Definitions \ 7. 9\ 7.10^ . Then 
there exists a classical algorithm that takes a two-qubits gate U (U Cz SU{4) in the complex case and 
U G 5*0(4) in the real case), and a number e > and outputs the description of a T Cz GTL{d) with 8 
in-pegs and 8 out-pegs such that 

||[/-A^V(T)|L, <e , (93) 

and in addition 

||A^V(T)|L. <l + e ■ (94) 
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Hg is the space of all 8-steps paths over the auxiliary graph of d ^ \/3 - not just those that start at 1. 
T is equal to a product of poly {log e~^) crossings with weights taken from W/d. The factor Aj- is the 
absolute value of the product of all the odd weights v/d that appear in T. Finally, the running time of 
the algorithm is poly (log e~^). 

Proof: The proof is given in Appendix O □ 



With this result, the proof of Theorem 110.21 is fairly straightforwards: 
Proof of Theorem [TU1[ 

It is easy to sec that the problem is in BQP. Indeed, we can use the proof of the quantum algorithm in 
Sec. l6.2l with the only difference that now we use the quantum computer to implement the operators p{Sj) 
of grouped elements Sj instead of the individual cups/caps and crossings. This is possible because we 
are guaranteed that each p{Sj) works only on the tensor product of polylogarithmically many registers. 
Therefore all the classical computations that we need to do on p{Sj), such as computing its norm, or its 
polar decomposition, can be done efficiently. 

Let us now prove that the problem is BQP-hard. The proof follows closely the proof of the universality 
theorem 17.11 (see Sec. 17. 5p . We are given a description of a quantum circuit over n qubits as a product 
of N two-qubits gate U = Un ■■■■■ Ui, and are asked to decide whether | (O^""] U |0'»") P < 1/3 or 
I (0^"| U\0^") p > 2/3. The only difference is that now we use the improved density and efficiency 
theorem 110.31 to approximate every two-qubits gate Ui. The result is a graph G which, according to 
claim 17.31 satisfies 

^ . ^ . . ^ , ^ ^^^^^ ^ ^g^^ 



^ -ZG(g,v)-(0®"|C/|0®") 



Afiard 

As previously mentioned, G is endowed with a natural grouping that is provided by the improved density 
and efficiency theorem: the crossings that approximate a single two-qubits gate are grouped together, 
while every cup/cap element has a group of its own. This is a legitimate grouping of G since every 
product of crossing acts on the tensor product of 9 local registers (8 strands). We will prove that the 
problem is BQP-hard by showing that for this particular grouping, Ahard — Agrp up to an overall factor 
of the order one. 

Let Sj be the product of elementary crossings that corresponds to the gate Ui. Following the universal- 
ity proof from Sec. l7.5[ it is created by using the improved density and efficiency theorem with e = 1/lOOA^. 
Consequently, we are promised that ~ A^^. up to a factor of order 1 ± I/IOOA^. The crucial 

point here is that the above condition is casted in terms of the Hg norm rather than the, weaker, Lg 
norm. is the space of all 8-stcps paths on the auxiliary graph of d = \/3, hence 1 1/5(5; )||^f = ||/9(5j)||. 
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Consequently, 

N N 

Y[\\p{S,)\\ c,l[As,= n l^^/^l' (96) 

with ~ indicating equality up to a factor of order one, and Eodd d E is the set of edges with odd weights 
in G. 

Lastly, we note that for 9 = 3, we have d = 2cos7r/6, therefore the path representation is Hcrmitian 
(see Sec. 15. 8p . In such case, according to Lemma f 10.11 the norm of every cup/cap operator is exactly 
(^1/2 = q^/^. Multiplying all these norms together, we get 

and as \V\ — 2n + \Eodd\, we conclude that 

Agrp 0, q\y\-\Eodd ( Yl \vA = Ahard . (98) 
\e<£Eadd I 

□ 
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A Tools for Building up the Dimensionality 
A.l Proof of the non-unitary Bridge lemma (Lemma I9.3p 

The non-unitary Bridge lemma can be proved with the aid of 4 smaller lemmas. For brevity, wc do not 
write which of the cases F = (D or F = R we treat since the proof is the same in both cases. 

Lemma A.l (Lemma A) Let C = A® B with dimS > dim A, and let W G SL{C). Then there exists 
a vector b <E B such that W \b) e B. 

Proof: Consider the subspace WB. If WB n B = then dim{WB ® B) = dim{WB) + dim(B) = 
2 dim{B) > dim C, which is a contradiction. □ 

Lemma A. 2 (Lemma B) Under the same conditions of the bridge lemma, it is possible to generate a 
transformation T^,^ e SL{C) that would take the vector \tp) to the vector \(f>) . 
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Proof: For each vector G C we will generate a transformation T.^h where \b) e S is some fixed vector. 
This will prove the lemma since we can define — T^^T^t,- 

• The dim A ^ 1 case 

Let A be spanned by the vector \a). As WB ^ B, wc pick \b) S B such that W \b) — a \a) + (3 \b') 
with a 7^ 0. By virtue of Lemma A we can also assume that ^ for we can always add to \(3) a 
vector from B whose image under W is inside B. 

Let = ofQ \o) + l^o); a-iid assume for a start that ao 7^ and /3o 7^ 0. Then we perform an 
SL{B) transformation that takes |6o) ^ ^ |^')- We get |V'i) = 77 (" !«> +/? I^>)- Acting with ly-^ 
we obtain \'4)2) ~ 77 \b) which can be then scaled to \b) using yet another SL{B) transformation. 

If ao = then e B and we can simply move it to \b). If /3o ~ then |?/') G A and so 1^ 
must have some projection on B, and we return to the previous cases. 

• The dim A> 1 case 

As in the first case, we pick pick \b) G B such that W \b) ~ a \a') + (3 \b') with a 7^ and /? 7^ 0. 

Let = ao \a) + /3o 1^0)7 a-nd assume for a start that ao 7^ and f3o 7^ 0. Then we perform an 
SL{B) transformation that takes |6o) ^ ^^^^ transformation that takes |ao) to 

^ \a). We get jV;!) = (a |a) + = Then Acting with W'^ we obtain 

If ao = is handled as in the dim A = 1 case. In the /3o = case, £ A and hence using a SL{A) 
transformation it can be moved to another vector in A whose image under W has some projection 
on B, after which we proceed as before. 

□ 

Lemma A. 3 (Lemma C) Under the same conditions of the bridge lemma, there exists a basis (not 
necessarily orthogonal) {\ai) , . . . , |am)} of A, a basis , . • . , \bn)} of B, and a transformation V G 
SL[C) that we can generate, which is defined as follows: V \ai) ~ \bi) , V \bi) = — \ai) and V is leaves 
the rest of the bases elements unchanged 

Proof: 

Pick any vector \ai) G A and a vector \b') e B. Then by Lemma B there exists a transformation 
T that takes |ai) to |6'). Use lemma A with the transformation T to find vectors , \b") E B such 
that T \bi) = \b"). Obviously \b") 7^ |5') and therefore they span a subspace of B with dimension 2. 
Denote by S the orthogonal complementary of this subspace in C. Then dimS* = dimyl + dimS — 2. We 
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now construct a transformation U E SL{B) such that U \b') = \b") and U \b") = — \b'), while leaving S 
unchanged. 

Now consider the transformation V = T^^UT. It has the following properties: 

• V\ai) = \bi) 

• =-|ai) 

• V leaves the space R =^ T^^S unchanged. This is because for every |s) G 5", we have 

VT-^ \s) = T-^UTT-^ \s) = T-^U |s> = T'^ \s) . (99) 

y is a non-singular transformation hence dim R = dim S = dim A + dim B — 2. Moreover, as \ai) , |&i) 
are not in R then C = span{R, \ai) , \bi)}. It follows that dim(i? O A) = dim A — 1. Indeed if dim(i? n 
A) > diniA then R must include A, contradicting the fact that |ai) ^ R. On the other hand, if 
dim(i?n A) < dim A — 1 then A must contains a vector other than \ai) which is not in R - contradicting 
the fact that C = span{R, \ai) , Similarly, we may prove that dim(i?n B) = dim_B — 1. 

Finally, we choose a basis {\a2) , ■ ■ ■ , \am)} for i? n A and a basis {I62) , • • • , \bn)} for ROB. Then 
{|ai) , . . . , \am)} is a basis of A and {\bi) , . . . , |6ti)} is a basis of B as required. □ 

Lemma A. 4 (Lemma D) Let B be a linear space with dim_B > 1, and let \a) be a vector outside of 
B and define the space C = B (B span{\a)}. Let W € SL{C) be a transformation that mixes B with 
span{\a)}. Then using W and transformations in SL{B) (which do not affect the \a) vector), we can 
generate SL{C). 

Proof: Let y be a transformation in SL(C). Define = V \a). Then according to Lemma B we can 
generate a transformation T that also takes \a) to 

Consider now the transformation X = T^^V . By generating it we will prove the lemma. X leaves 
\a) invariant. If, in addition, X G SL{B) then we are done. Assume then that X ^ SL{B), and pick a 
basis \bi) , . . . , |6„) of B. The action of A on i? can be described by 

A|6,) -^Ar|6,)+x.|a) (100) 
i 

Here A'' is a n x n matrix. Since det A = 1, and X \ a) = |a), it follows that det A'' = 1. Also notice that 
as XB B then at least of Xi must be non-zero. 

We now wish to generate a transformation Y that similarly to A, leaves |a) invariant while YB 7^ B. 
Let T be a transformation that takes \a) \bi), and consider transformations Y of the form Y = T^^UT 
where U is any transformation in SL{B) that leaves |6i) invariant. Then Y \a) = \a). We claim that there 
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must he a. U that creates Y such that YB ^ B. Indeed if YB = B then UTB = TB, i.e., U preserves 
the subspace TB. However, this is a contradiction since ^ TB and at the same time by Lemma A 
there are \b') , |6") G B such that f\b') = \b"). So we can construct a U that takes |6") \b") + \bi) 
(which is outside TB) while leaving \bi) invariant. 
We conclude that 

Y\b.)=Y.Y:^\b,)+y,\a) (101) 
i 

with Y^ being an n x n matrix with det — 1 and at least one yi is non-zero. 

Let N E SL{B) be the transformation that in the {\bi) , . . . , basis is given by Nl^ such that 

Y,Nl,y,=x,. (102) 

Similarly let M e SL{B) be such that 

Then it is easy to see that MYN = X. We have thus generated X. □ 

We are now in position to prove the Bridge lemma: 
Proof: Using lemma C we generate a transformation Vi that mixes a vector |ai) G A with B while 
leaving the rest of A unchanged. We then use lemma D to generate SL{B span{\ai)}) . Repeating this 
process we add more and more vectors from A until we generate all SL{A © B). Finally, a quick glance 
at the proof of Lemmas A-D reveals that in each one of them we used a finite number of transformations. 
Therefore every transformation in SL{A B) can be represented as a finite product of transformations 
from SL{A),SL{B) and the bridge W. □ 

A. 2 Proof of the non-unitary Decoupling lemma (Lemma 19.41) 

Consider the subgroups Ha of SL{A) and Hb of SL{B) which is defined by 

Ha = {X e SL{A)\3{gr,} € G s.t. pA{gn) ^ X and peign) ^ 1} , (104) 
Hb =' {XeSL{B)\3{gn}eGs.t. pBign)^X andpAign)-^!} (105) 

We would like to prove that if dim A ^ dim B then Ha = SL{A) and Hb = SL{B) (complete decoupling), 
whereas if dim A = dimB then either we have a complete decoupling, or Ha is in the center of SL{A) 
and Hb is in the center of Hb- 

We first notice that Ha and Hb are normal closed subgroups of SL{A) and SL{B) respectively. The 
proof is straight forwards, and follows exactly a similar claim in the unitary decoupling lemma. 
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We now use fact that SL{A) and SL{B) are "almost simple Lie groups", both for F = ]R and for 
_F = (D, which are also connected groups. Then every normal closed subgroup of them is either the whole 
group or is in their center. It is easy to see that if one of them is the whole master group then so must 
be the other. In such case the lemma is proved. Let us therefore assume that both subgroups are in the 
center of their master groups. We would like to show that in such case we must have dim A = dimS. 

We now define a homomorphism M : SL{A)/Ha SL{B)/Hb- For every coset VaHa in SL{A) / Ha 
we find a series {g,i} in G such that pA{gn) converges to some element in VaHa and psign) converges to 
some Vb <E SL{B). The existence of such series is promised by the conditions of the lemm4j Following the 
same argument of the unitary decoupling lemma we prove that M is a continuous 1 — 1 homomorphism 
between SL{A)/Ha and SL{B)/Hb- But these c^uotient groups are also Lie groups which can also be 
viewed as smooth manifolds. The isomorphism is thus turned into diffeomorphism between two manifolds, 
which can only happen if they have the same dimensions. But Avai S L{A) / H a = AmiSL{A) because Ha 
is finite, and similarly dim SL{B)/ Hb = dim SL{B). Therefore dimSL{A) = dimSL{B) which gives us 
dim A = dimB. 



B Variants of the Solovay-Kitaev algorithm 

We now prove Theorem 17.61 We follow very closely the derivation of Dawson & Nielsen in Ref [35] . 

We are given an eo-net over Bji{M), whose members are transformations in SL{M, (D) (the complex 
case) or S'L(Af, R) (the real case). The size of eo depends on m dim A/ and on R. We will not give 
an explicit formula for it, but instead assume that is small enough (but finite!). By a close inspection of 
the derivation, one can easily find the desired size of eq- 

The SK algorithm is basically a recursive routine that receives a transformation V (not necessary 
unitary - but inside Bji{M)), and an integer n, and returns Vn - the n'th order approximation for V. 
The distance between Vn and V is smaller than e„ c^^(ceo)'-'^/^-'" , for some constant c < I/eq, thereby 
providing a super-exponential convergence. 

The main idea of the algorithm is unchanged. The goal is to find an e„-approximation for A VVn^i ■ 
This is done by finding a product of commutators that approximate A to a factor of O {e^J^i ) , while making 

1/2 

sure that the matrices in the commutators are at most 0{ej^_i) away from unity. This approximation, 
however, is not given in terms of the generators. Therefore in the last step, we find a e„_i approximation 
to the every matrix that appears in the commutators in terms of the generators, and use the following 

•^Notice that this is essential the only difference between the unitary and non-unitary decoupling lemmas. In the unitary 
case, the existence of such converging series is always promised due to the compactness of the unitary groups. We do not 
know if this is so in the non-unitary case, hence we used the extra condition. 
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general property of the group commutator: if y, W are are an e approximation for V, W , and in addition 
the distance of V, W from unity is 5, then 

lV,Wi~lV,W}\\=OieS) . (106) 

(see Lemma lB.2p . This allows us to give an approximation of A in terms of the generators. 

The overall result is that we find Vn such that — < ■ is a product of 13" generators, 

and constructing it takes a similar number of steps. Yet the super-exponential convergence provides the 
promised asymptomatic. This asymptotic is probably not an optimal one, yet it is simple to derive and 
suffice for our purpose. 

We begin by presenting a pseudo-code version of the algorithm, followed by an analysis of its routines. 
Most of them arc the same for the complex and real cases, except for the GC-Unitary-Approx routine, 
for which we supply a different version for each case. 



B.l A pseudo-code of the algorithm 

function SK( Gate V e Br{M) , depth n) 
if (n 0): 

Return Basic-Approxiination(y) 

else 

Set Vn = SK(V,n~ 1) 

Set A = VV-_\ 

Set A,P= Polar-Decomp(A) 

Set Va,Wa = GC-Unitary-Approx(A) 

Set Vo,Wo,Ve,We = GC-Hermitian-Approx(yl) 

Set V^,W^, y;, W;,, = SUVa, Wa, Vo, Wo, Ve, We, 71-1) 

Set A' - W'J ■ K, W^j ■ iV:, M/^l 

Return A'K-i 



B.2 The Basic-Approximation (y) function 

This function returns the zero approximation to y G Bfj{M). This is a single generator whose distance 
from V is smaller than eg. Such a generator must exist by the condition of the ep-nct. As this net is 
finite, the search can be done in a finite time with a naive search. 

The necessity of the eo-net to be finite is the reason why we limit ourselves for transformation in 
Bji{AI) instead of the full group SL{M, (D) (or SL{M,]R)), whose volume is infinite. 
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B.3 The Polar-Decomp(A) function 

The Polar-Decomp(A) routine gets a general invertible matrix A with ||A — 1|| < e„_i and returns its 
polar-decomposition 

A = AP , (107) 

with ||j4 — 1||, ||P — 1|| = 0{en-i)- The polar-decomposition is a standard procedure which can be done 
efficiently. In the complex case, ^ is a unitary matrix and P is a positive-definite Hermitian matrix. In 
the real case, A is an orthogonal matrix while P is a real positive-definite matrix. 

The next lemma promises us that if A is close enough to 1, then also P and A will be close to 1. 

Lemma B.l For small enough e, if ||A — 1|| < e and A = AP with det A = 1 then \\A — 1\\ < Ce and 
||P — 1|| < C'e for some constant C which only depends on m. 

Proof: Since ^ is unitary (or orthogonal) then I |P| I = ||A||. But ||A|| = ||A-1+1|| < ||A-1|| + 1 < 1+e. 
Therefore ||P|| < 1 + e. 

P is positive-definite Hermitian matrix and is therefore diagonalizable with eigenvalues ri > r2 > 
• ■ • > ?'m > 0. In addition, as det P = 1, we have Ili ^ 1- It follows that | |P| | = ri < 1 + e. But since 
JI^Ti = 1 it also follows that > (1 + e)^(™~i) > 1 — Cie, with C" being a constant that depends only 
on TO. Combining these two results, we find that there are two m-dependent constants Ci, C2 such that 
||P — 1|| < Cie and ||P^^ ~ 1|| < C2e. Finally, as ^ = AP~^, we there is an TO-dependent constant C3 
such that I A — 1|| < 6*36. Taking C = max(Ci, C2, C3) proves the lemma. □ 

B.4 The GC-Unitary-Approx(A) routine (complex case) 

This routine receives a special- unitary matrix A with — 1|| < e, and (for epsilon small enough) returns 
two special- unitary matrices Va,Wa such that - 1||, ||Va - 1|| < Cie^/^ and - |Vk,M^Al|| < 

C2e^^^- Note that as Va,Wa have unit determinant, then for small enough e, they are inside Bfi{M). 
Again, this condition is guaranteed to hold provided eg is small enough. 
The description of this function is given in [38] . 

B.5 The GC-Unitary-Approx(yl) routine (real case) 

The real case is similar to the complex case, only that here A is a real orthogonal matrix, and resulting 
matrices Va, Wa are real matrices (in order to have them in B{M, R)) - though not orthogonal. 
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Consider then an orthogonal matrix A with ||^ — 1|| < e. It is well-known that A admits a canonical 
form in which it is block-diagonal with 2x2 blocks of the form 



0(9) 



(108) 



cos y sm a 
— sin 6 cos 6 

and an optional diagonal term of 1 (when dim AI is odd) . . The canonical basis is connected to the 
standard basis of A via an orthogonal transformation, hence we will assume without loss of generality 
that A is already given in its canonical form. We will now work in every block separately. 

The one-dimensional block, if exists, is trivially taken care of by also setting Va, Wa to be 1. In the 
2x2 blocks, we first work in the diagonalizing basis using the transformation 

1 i 



V2 



which gives us 



S''O{0)S = 







1 



exp 



i0 
-i6 



dof ^iH 



(109) 



(110) 



Notice that the condition \\0{9) - 1|| = 0{e) implies \\H\\ = \e\ = 0{e). 
Consider the two traceless and Hermitian matrices 



1 

1 



1 

-1 



(111) 



It is easy to verify that [F, G] = iH and that | |F| |, | |G| | = 0{^/\\H\\) = 0{e^/^). Here [F, G] =^ FG-GF 



dcf 



dcf 



is the Lie commutator, not to be confused with the group commutator. Defining = e and W'j, 
it is easy to see that — \\W'j^ — 1|| = 0{e^^^). We now use a general exponential expansion to 
deduce that for every matrices A,B with ||^||, ||i3|| < 6, necessarily. 



(112) 



for some c w 4 . This implies that 

Finally, going back to the original basis, we define 

Va = SV^S^ = e^^^' , and Wa SW^^S^ = e^«^' , 

which gives 

\0{e)-lVA,WAl\\^0{e^/^) . 

It is easy to verify that SFS'' = F, while SGS'' = dia.g{./ej2, - ^/e/2). Therefore Va,Wa are in 
SL{M, R) as required. 



(113) 

(114) 
(115) 
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B.6 The GC-Hermitian-Approx(P) routine 



This routine gets an Hermitian matrix P with ||P — 1|| < e and returns four matrices Vo,Wo,Ve,We G 
Bji{M), whose distance from 1 is 0(e^/^), and 



lVo,Wo\-lV,,W,\-P =0(e3/2) 



(116) 



Its structure is similar to the structure of the GC-Unitary-Approx routine in the real case, only that 
now we work with a positive-definite, Hermitian matrix P (in the real case P is symmetric). 

We assume without loss of generality that P is given in its diagonal form P = diag(ri, . . . ,r,„) with 
1^ ^2 > ■ ■ ■ > Tm > and ri • r2 • . . . ■ r„i = 1 . This is because the diagonalizing matrix is either unitary 
(in the complex case) or is orthogonal (in the real case). 

As in the previous routine, we would like to work with pairs of eigenvalues with opposite signs. 
We do that by decomposing P into a product of two diagonal matrices P ~ Pq ■ Pe where Po = 
diag(Ai, 1/Ai, A3, 1/A3, . . .) (odd indices) and P^ = diag(l, A2, 1/A2, A4, 1/A4, . . .) (even indices). It is 
easy to verify from the condition ||P - 1|| < e that \\Po - l\\,\\Pe - 1|| = C(e)- 

Consider now a (A, 1/A) block in either matrices. It can be written as with H = diag(6', —9), and 
= A. Using the same trick that we used in the previous routine, we may write H = [F, G] with 



F 



dcf 



1 

1 



1 

-1 



(117) 



By the same arguments that were given in the previous routine, V ^ e^ and W — e^ are both det = 1, 
real matrices, with \\V - 1|| = ©(e^/^), and 

A 



(118) 





1/A 

The fact that V, W are real matrices is crucial to the real case where we must work with matrices from 
SL{M, R). 

Performing the above decomposition for every 2x2 block in Pq, P^, it follows that we can find matrices 
Vo, Wo, Ve, Ve in Bji{M) whose distance from the unity is 0{e^^^) and which satisfy Eq. (|116p . 



B.7 Approximating the commutators in the main routine 

The distance between the output of the main routine and V is ||A'y„_i — = ||A'V^_i — AT^„_i|| < 
(i? + 1)||A' - A||. The last equality follows from the fact that K-i e Be.{M), hence < R+l- 

Let us now estimate ||A' — A||. We use the following result that generalizes Lemma 1 in page 9 of 
Ref [35]: 
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Lemma B.2 LetV,W,V,W be four matrices such that \\V-V\\,\\W-W\\ < e and < 6. 

Then (for e,6 small enough) there exists a constant C such that 



iv,wi-iv,wi 



< CeS . 



(119) 



Proof: 

We expand F, in e by 



V 
W 



V + eA + 0{e^) , 
W + eB + 0{e^) . 



Then it is easy to verify that 



Plugging this into VWV^^W~^ we obtain 

VWV-^W'^ = VWV'^W^^ +eAWV-^W-^ +eVBV-^W-^ 
~ eVWV-^AV-^W-^ - eVWV-^W-^^BW-^ + 



(120) 
(121) 

(122) 
(123) 

(124) 
(125) 



We wish to bound the norm of the terms hnear in e. There are four such terms. Consider the terms that 
involve A (other two terms are treated similarly). As are bounded from above, we get: 

WAWV^W-^ - WV~^ AV-^W^W < C\\AWV-^ - WV-^AV^W . (126) 

Now expand = 1 + 5D and plug it in the RHS of the above equation. We get 

||A(1 + 5D)V^^ - (1 + 5D)V-^AV-^\\ ^ \\AV-^ - V-^AV'^ + 0{5)\\ (127) 



< 111 - V- 



\V-'\\+0{5) . 



(128) 



However, by virtue of our e-expansion, ||A|| = 0(1) and ||V^~^|| is also bounded since ||T^— 1|| < 5. 
Therefore, we are left with H^^^ — 1|| which is 0{5). We have thus shown that lowest non-trivial order 
is t5. 



□ 



With the last Lemma, we are able to show that 1 1 A' — A| | < Ce^J^i- Indeed by the previous reasoning 
we have A = [F^, Wa} ■ Wo, Wo\ ■ [14, W,}, and A' = W^Al ' W^i ' W'el with aU the unprimed 

1/2 

matrices within distance C'(e,/_]^) from 1. Also, by the recursive application of the SK routine, the distance 
of the primed matrices from the unprimed matrices is 0{en-i)- Using the last lemma, we get 



(129) 
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and similarly 

Therefore ||A' — A|| = 0{e^/^^). A careful examination of the previous steps (which we omitted for sake 
of clarity) reveals that the statement can be written as 



|A'-A||<Cel'i , 



(131) 



with C being some constant that depends on m and R. 



B.8 Verifying the asymptotic 

We will now prove by induction that \\Vn — < '= c^^(ceo)'-'^^^-'", for some constant c, and that Vn 
is a product of 11" generators. 

For n = the statement is trivially true. Assume that it is true for n — 1 and consider the n'th case. 
Using the result of the last section, we have 

3/2 



(132) 



UK - V\\ < CeT., = G Y\ce,tl^y'~'\" = Ccr^l\ 

Therefore choosing c = and making sure that ceg < 1 gives the desired result. 

Finally, if Ki-i, W^A' Kjj K'i ^^"^ ^ products of 13"~^ generators then it is clear from the 
formula of that it contains 13" generators. 

This completes the proof of the non-unitary Solovey-Kitaev theorem. 



C Proof of improved 8-strands density and efficiency theorem 
(Theorem [Tim 

We first list the different subspaces that make up i7| when q = 3. In such case d = \/3 = 2cos7r/6, and 
we have an Hermitian representation that is defined using a finite auxiliary graph with 5 sites. Indeed, 
the first 6 entries of the general eigenvalue that is described in Eqs. (|29ll33p are tt = (1, V3, 2, a/3, 1,0,.. .). 
Therefore iJg has a finite dimension, and can write down its decomposition to the subspaces Hg^k-^i with 
k,£ G {1, 2, 3, 4, 5} and even |fc — ^|. An easy calculation shows that there are exactly 13 such subspaces, 
with a total dimensionality of 378. They are fully listed in Tabled 

Because of the fact that tt.; ~ T^e-i, it is easy to see that the spaces H^^i^j are isomorphic to Hg^Q^i^Q-j 
with respect to the operators, and consequently the crossings operators. Moreover, under the trans- 
formation |pi,p2, • • • iPg) > |6 — pi, 6 — p2, . . . , 6 — pg), these operators retain the same matrix form. In 
addition we can prove that the spaces Hg,^i^j arc isomorphic to H^.j^i- 
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dim 


Subspaces 


14 


Hs,!^!, -^8,5^5 


13 


H8,l-f5, -^8,5— 1 


27 


^^8,1— 3, -^8,3^1, ^^8,3^5, ^^8,5^3 


40 


H8,2^4, Hs,4:-t2 


41 


H8,2-t2, -^8,4^4 


54 


^^8,3^3 



Table 2: The list of subspaces that make up the full 8-steps space for g = 3. Subspaces with the same 
dimensionality are isomorphic to each other with respect to the path representation. 

It follows that all subspaces with the same dimensionality are isomorphic to each other. They are 
therefore grouped in same line in Table [21 

The first step of the proof is to show that in every subspacc H^^k^t, the normalized crossing operators 
(Ti, . . . , (Ty create a dense subgroup, either in SL[H8.k^i^ C) or in SL{H8^k—>e, IR)- Notice that according 
to Lemma XXX, we need not care about the actual normalization of these operators, because in the 
end we always use the commutator group, which cancels them out. In addition, due to the isomorphism 
between subspaces of equal dimensionality, it is enough to prove density over six subspaces of different 
dimensionality. 

To prove density in a general H^.k^t, we follow the same steps we took in the density proof of Hg^i^i. 
We first establish an SL{2, (D) or SL{2, R) density on a two-dimensional subspaces (the seeding), and then 
build up density on larger and larger subspaces using the non-unitary Bridge and Decoupling lemmas. 
The trickiest part of the proof is the first part, the seeding. We need to find a two dimensional subspace 
and a pair of operators ^i+i which operate inside that subspace. Then the corresponding crossing 
operators will also operate inside that subspace. In the Hg.i^i case this was done by finding a path 
in Fig. I10[l which is a non-trivial eigenvector of some $i ((f>i in the H^.i^i case). This vector is mixed 
with another vector {\p2) in Fig. fTU]) by either <I>i+i or while at the same time, the other vector is 

sent to zero under the action of $i. 

Such non-trivial eigenvector can always be found if H^^k^i contains "zigzag" paths that "hit the 
ceiling or the floor". More precisely, we are looking for paths that contain a passage like 1 ^ 2 ^ 1 
or 5 — > 4 5. When this happens at the i, i + I steps, then such a path is an eigenvalue of <E>i with 
eigenvalue d. The reason is that when <I>i acts on a path \. ..£,£+ 1,£ .. .), or |. 1, ^ .. .), it will 

transform it to a linear combinations of |. . . ^, ^ ± 1, ^ . . .). But when £ is the first or last site (£ = 1 or 
£ = 5 respectively), one of the eigenvectors does not exists, hence the other becomes an eigenvector. 
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It is easy to see that such "zigzag" paths must exist in ah of the subspaces of Table [5] because we are 
deahng with 8-stcps paths over a graph with only 5 vertices; no matter where we start, we can always 
reach the one end and bounce back. Finally, it is also easy to check that the two-dimensional matrices 
always have the same form - independently of the subspace that we work in - and therefore the density 
criteria that is given in Definition XXX is valid also here. 

The next step is the use density on the two-dimensional space and prove density over larger and larger 
spaces using the Bridge lemma and the decoupling lemma. This step can probably be done generally 
for a large class of subspaces, using some kind of an induction argument. However, as we are interested 
solely in q = 3, we opted for a brute- force solution. We designed a simple computer code that found a 
series of moves consisting of an altering application of the Bridge and Decoupling lemmas that establish 
density in each subspace. Not surprisingly, the details of the derivation in each subspace is far too long 
and tedious to be given here. 

After we established density on every subspace, we can use the non- unitary Decoupling Lemma 19.41 
to prove that spaces with different dimensionality decouple. In other words, we can approximate any 
operator on V on using the normalized crossing operators provided that: 1) is block-diagonal on 
the subspaces of Table and 2) blocks with the same dimensionality are coupled to each other by the 
similarity transformation that defines the isomorphism of these subspaces. 

We now wish to use the Solovey-Kitaev theorem to generate such approximations efficiently. Looking 
at the proof of the non- unitary Solovey-Kitaev theorem that is given in Appendix |B1 shows that it is easy 
to generalize it to a direct sum of spaces. More precisely, the generalization goes as follows: we define 
Ml, . . . Mk to be finite subspaces, and look at M — Mi © . . . ©M^. We wish to approximate any operator 
in Bfj{M), which is the set of transformations V such that: 1) V is block diagonal on M according to 
the Mi blocks, 2) at every block it belongs to SL{M,,€) (or S'L(Mj,R)), and 3) \\V - 1\\ < R. This 
is done using a set of generators that create an eo-net over Bii{M) for some eq that depends on the 
dimensionality of the various Mi spaces, as well as on R. 

The fact that we establish density, independently, over all the subspaces of with different dimen- 
sionality, means that we can use the generalized Solovey-Kitaev for a direct sum of spaces with different 
dimensionality. In other words, we take Mi, . . . , Afg to be representatives from each dimensionality class 
in TableO with Mi = Hs^i^i- We wiU use the generahzcd Solovey-Kitaev on M = Mi © ... © Mg. 

Consider a two-qubits gate U and its encoded version U_ which operates on Hg^i^i. We can extend 
[/ to be a block operator on M by setting to 1 over all the other five blocks. Then we use the extended 
Solovey-Kitaev theorem to find an e-approximation V to UnU for some e > 0, using the normalized 
crossing operators. How does V behaves on the subspaces which arc included in M? By virtue of the 
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isomorphism between these spaces to their representative in M, we conclude that V is either conjugate to 
a e-approximation of the unity matrix or to a e-approximation of U_. In both cases, since the conjugation is 
done by a unitary matrix, the norm of V is bounded by 1 + e. To summarize, we found an approximation 
V in terms of the normahzcd crossing operators, such that 

\\V-U\\Hs^^^^<e, (133) 

and 

\\V\\H^<l + e. (134) 

Lastly, we write the approximation V in terms of the un-normalizcd crossing operators, and wc recover 
the GTL{d) element T such that V = A^^p{T). Substituting this in the two equations above, and using 
the fact that || ■ Uls < || ■ Whs i^i proves the theorem. 
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